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NON-ENDOGENOUS, CONSTITUTIVELY ACTIVATED 
HUMAN SEROTONIN RECEPTORS AND SMALL 
MOLECULE MODUALTORS THEREOF 

5 The benefit of U.S. Serial Number 09/060,188, filed April 14, 1998 (owned by 

Arena Pharmaceuticals, Inc.) and U.S. Provisional Number 60/090,783, filed June 26, 1998 
(owned by Arena Pharmaceuticals), U.S. Provisional Number 60/1 12,909, filed December 
18, 1998, and U.S. Provisional Number 60/123,000 filed March 5, 1999 is hereby claimed. 

1 o FIELD OF THE INVENTION 

The present invention relates to non-endogenous, constitutively active serotonin 
receptors and small molecule modulators thereof. 

BACKGROUND OF THE INVENTION 

15 

I. G protein-coupled receptors 

G protein-coupled receptors share a common structural motif. All these receptors have 
seven sequences of between 22 to 24 hydrophobic amino acids that form seven alpha helices, 
each of which spans the membrane. The transmembrane helices are joined by strands of amino 

20 acids having a larger loop between the fourth and fifth transmembrane helix on the 
extracellular side of the membrane. Another larger loop, composed primarily of hydrophilic 
amino acids, joins transmembrane helices five and six on the intracellular side of the 
membrane. The carboxy terminus of the receptor lies intracellularly with the amino terminus in 
the extracellular space. It is thought that the loop joining helices five and six, as well as, the 

25 carboxy terminus, interact with the G protein. Currently, Gq, Gs, Gi, and Go are G proteins 
that have been identified. The general structure of G protein-coupled receptors is shown in 
Figure 1. 

Under physiological conditions, G protein-coupled receptors exist in the cell membrane 
in equilibrium between two different states or conformations: an "inactive" state and an 
30 "active" state. As shown schematically in Figure 2, a receptor in an inactive state is unable to 
link to the intracellular transduction pathway to produce a biological response. Changing the 
receptor conformation to the active state allows linkage to the transduction pathway and 
produces a biological response. 
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A receptor may be stabilized in an active state by an endogenous ligand or an 
exogenous agonist ligand. Recent discoveries such as, including but not exclusively limited to, 
modifications to the amino acid sequence of the receptor provide means other than ligands to 
stabilize the active state conformation. These means effectively stabilize the receptor in an 

5 active state by simulating the effect of a ligand binding to the receptor. Stabilization by such 
ligand-independent means is termed "constitutive receptor activation." 
II. Serotonin receptors 

Receptors for serotonin (5-hydroxytryptamine, 5-HT) are an important class of G 
protein-coupled receptors. Serotonin is thought to play a role in processes related to learning 

10 and memory, sleep, thermoregulation, mood, motor activity, pain, sexual and aggressive 
behaviors, appetite, neurodegenerative regulation, and biological rhythms. Not surprisingly, 
serotonin is linked to pathophysiological conditions such as anxiety, depression, obsessive- 
compulsive disorders, schizophrenia, suicide, autism, migraine, emesis, alcoholism and 
neurodegenerative disorders. 

15 Serotonin receptors are divided into seven subfamilies, referred to as 5-HT1 through 5- 

HT7, inclusive. These subfamilies are further divided into subtypes. For example, the 5-HT2 
subfamily is divided into three receptor subtypes: 5-HT2A, 5-HT2B, and 5-HT2C. The human 
5-HT2C receptor was first isolated and cloned in 1987, and the human 5-HT2A receptor was 
first isolated and cloned in 1990. These two receptors are thought to be the site of action of 

20 hallucinogenic drugs. Additionally, antagonists to the 5-HT2A and 5-HT2C receptors are 
believed to be useful in treating depression, anxiety, psychosis and eating disorders. 

U.S. Patent Number 4,985,352, describes the isolation, characterization, and expression 
of a functional cDNA clone encoding the entire human 5-HT1C receptor (now known as the 
5HT2C receptor). U.S. Patent Number 5,661,0124 describes the isolation, characterization, 

25 and expression of a functional cDNA clone encoding the entire human 5-HT2A receptor. 

Mutations of the endogenous forms of the rat 5-HT2A and rat 5-HT2C receptors have 
been reported to lead to constitutive activation of these receptors (5-HT2A: Casey, C. et al 
(1996) Society for Neuroscience Abstracts, 22:699.10, hereinafter "Casey"; 5-HT2C: Herrick- 
Davis, K., and Teitler, M. (1996; Society for Neuroscience Abstracts, 22:699.18, hereinafter 

30 "Herrick-Davis 1"; and Herrick-Davis, K. et al (1997; J.Neurochemistry 69(3): 1138, 
hereinafter "Herrick-Davis-2"). Casey describes a mutation of the cysteine residue at position 
322 of the rat 5-HT2A receptor to lysine (C322K), glutamine (C322Q) and arginine (C322R) 
which reportedly led to constitutive activation. Herrick-Davis 1 and Herrick-Davis 2 describe 
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mutations of the serine residue at position 312 of the rat 5-HT2C receptor to phenylalanine 
(S312F) and lysine (S312K), which reportedly led to constitutive activation. 



SUMMARY OF THE INVENTION 

5 The present invention relates to non-endogenous, constitutively activated forms of 

the human 5-HT2A and human 5-HT2C receptors and various uses of such receptors. 
Further disclosed are small molecule modulators of these receptors. Most preferably, these 
modulators have inverse agonist characteristics at the receptor. 

More specifically, the present invention discloses nucleic acid molecules and the 

10 proteins for three non-endogenous, constitutively activated human serotonin receptors, referred 
to herein as, AP-1, AP-3, and AP-4. The AP-1 receptor is a constitutively active form of the 
human 5-HT2C receptor created by an S310K point mutation. The AP-3 receptor is a 
constitutively active form of the human 5-HT2A receptor whereby the intracellular loop 3 
(IC3) portion and the cytoplasmic-tail portion of the endogenous human 5-HT2A receptor have 

15 been replaced with the IC3 portion and the cytoplasmic-tail portion of the human 5-HT2C 
receptor. The AP-4 receptor is a constitutively active form of the human 5-HT2A receptor 
whereby (1) the region of the intracellular third loop between the proline of the transmembrane 
5 region (TM5) and the proline of TM6 of the endogenous human 5-HT2A receptor has been 
replaced with the corresponding region of the human 5-HT2C receptor (including a S310K 

20 point mutation); and (2) the cytoplasmic-tail portion of the endogenous human 5-HT2A 
receptor has been replaced with the cytoplasmic-tail portion of the endogenous human 5-HT2C 
receptor. 

The invention also provides assays that may be used to directly identify candidate 
compounds as agonists, partial agonists or inverse agonists to non-endogenous, constitutively 
25 activated human serotonin receptors; such candidate compounds can then be utilized in 
pharmaceutical composition(s) for treatment of diseases and disorders which are related to the 
human 5-HT2A and/or human 5-HT2C receptors. 

These and other aspects of the invention disclosed herein will be set forth in greater 
detail as the patent disclosure proceeds. 
30 BRIEF DESCRIPTION OF THE DRAWINGS 

In the following figures, bold typeface indicates the location of the mutation in the non- 
endogenous, constitutively activated receptor relative to the corresponding endogenous 
receptor. 
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Figure I shows a generalized structure of a G protein-coupled receptor with the 
numbers assigned to the transmembrane helices, the intracellular loops, and the extracellular 
loops. 

Figure 2 schematically shows the active and inactive states for a typical G protein- 
5 coupled receptor and the linkage of the active state to the second messenger transduction 
pathway. 

Figure 3a provides the nucleic acid sequence of the endogenous human 5-HT2A 
receptor (SEQ.ID.NO: 24). 

Figure 3b provides the corresponding amino acid sequence of the endogenous human 
10 5-HT2A receptor (SEQ.ID.NO: 25). 

Figure 4a provides the nucleic acid sequence of the endogenous human 5-HT2C 
receptor (SEQ.ID.NO: 26). 

Figure 4b provides the corresponding amino acid sequence of the endogenous human 
5-HT2C receptor (SEQ.ID.NO: 27). 
15 Figure 5a provides the nucleic acid sequence of a constitutively active form of the 

human 5-HT2C receptor ("AP-1 cDNA" - SEQ.ID.NO: 28). 

Figure 5b provides the corresponding amino acid sequence of the AP-1 cDNA ("AP- 
1" - SEQ.ID.NO: 29). 

Figure 6a provides the nucleic acid sequence of a constitutively active form of the 
20 human 5-HT2A receptor whereby the IC3 portion and the cytoplasmic-tail portion of the 
endogenous 5-HT2A receptor have been replaced with the IC3 portion and the cytoplasmic-tail 
portion of the human 5-HT2C receptor ("AP-3 cDNA" - SEQ.ID.NO: 30). 

Figure 6b provides the corresponding amino acid sequence of the AP-3 cDNA ("AP- 
3" -SEQ.ID.NO: 31). 

25 Figure 6c provides a schematic representation of AP-3, where the dashed-lines 

represent the portion obtained from the human 5-HT2C receptor. 

Figure 7a provides the nucleic acid sequence of a constitutively active form of the 
human 5-HT2A receptor whereby (1) the region of the between the proline of TM5 and the 
proline of TM6 of the endogenous human 5-HT2A receptor has been replaced with the 

30 corresponding region of the human 5-HT2C receptor (including a S3 1 OK point mutation); and 
(2) the cytoplasmic-tail portion of the endogenous 5-HT2A receptor has been replaced with the 
cytoplasmic-tail portion of the endogenous human 5-HT2C receptor ("AP-4 cDNA" - 
SEQ.ID.NO:32). 
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Figure 7b provides the corresponding amino acid sequence of the AP-4 cDNA ("AP- 
4"-SEQ.ID.NO: 33). 

Figure 7c provides a schematic representation of the mutated 5-HT2A receptor of 
Figure 7b where the dashed-lines represent the portion obtained from the human 5-HT2C 
5 receptor. 

Figure 8 is a representation of the preferred vector, pCMV, used herein. 

Figure 9 is a diagram illustrating (1) enhanced [ 35 S]GTPyS binding to membranes 
prepared from COS cells expressing the endogenous human 5-HT2C receptor in response to 
serotonin, and (2) inhibition by mianserin using wheatgerm agglutinin scintillation proximity 
1 0 beads. The concentration of [ 35 S]GTPyS was held constant at 0.3 nM, and the concentration of 
GDP was held at 1 \xM. The concentration of the membrane protein was 12.5 jag. 

Figure 10 is a diagram showing serotonin stimulation of [ 35 S]GTPyS binding to 
membranes expressing AP-1 receptors in 293T cells and the inhibition by 30 |aM mianserin on 
Wallac™ scintistrips. 

15 Figure 11 is a diagram showing the effects of protein concentration on [ 35 S]GTPyS 

binding in membranes prepared from 293T cells transfected with the endogenous human 5- 
HT2C receptors and AP-1 receptors compared to cells transfected with the control vector 
(pCMV) alone in the absence (A) and presence (B) of 10 |aM serotonin. The radiolabeled 
concentration of [ 35 S]GTPyS was held constant at 0.3 nM, and the GDP concentration was held 

20 constant at 1 \iM. The assay was performed on 96-well format on Wallac™ scintistrips. 

Figure 12 provides bar-graph comparisons of inositol trisphosphate ("IP3") 
production between the endogenous human 5HT2A receptor and AP-2, a mutated form of 
the receptor. 

Figure 13 provides bar-graph comparisons of inositol trisphosphate ("IP3") 
25 production between the endogenous human 5HT2A receptor and AP-4, a mutated form of 
the receptor. 

Figure 14 provides bar graph comparisons of IP3 production between the endogenous 
human 5-HT2A receptor and AP-3, a mutated form of the receptor. 

Figure 15 provides bar-graph comparisons of IP3 production between the endogenous 
30 human 5-HT2C receptor and AP- 1 . 

Figures 16A-C provides representative auoradiograms showing displacement of I 125 - 
LSD from brain sections by spiperone and compound 1 16100. 

Figure 17 shows in vivo response of animals to 116102 exposure. 
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DEFINITIONS 

The scientific literature that has evolved around receptors has adopted a number of 
terms to refer to ligands having various effects on receptors. For clarity and consistency, the 
following definitions will be used throughout this patent document. To the extent that these 
5 definitions conflict with other definitions for these terms, the following definitions shall 
control. 

AGONISTS shall mean moieties that activate the intracellular response when they 
bind to the receptor, or enhance GTP binding to membranes. 

AMINO ACID ABBREVIATIONS used herein are set out in Table 1 : 



TABLE 1 



ALANINE 


ALA 


A 


ARGININE 


ARG 


R 


ASPARAGINE 


ASN 


N 


ASPARTIC ACID 


ASP 


D 


CYSTEINE 


CYS 


C 


GLUTAMIC ACID 


GLU 


E 


GLUT AMINE 


GLN 


Q 


GLYCINE 


GLY 


G 


HISTIDINE 


HIS 


H 


ISOLEUCINE 


ILE 


I 


LEUCINE 


LEU 


L 


LYSINE 


LYS 


K 


METHIONINE 


MET 


M 


PHENYLALANINE 


PHE 


F 


PROLINE 


PRO 


P 
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SERINE 


SER 


S 


THREONINE 


THR 


T 


TRYPTOPHAN 


TRP 


W 


TYROSINE 


TYR 


Y 


VALINE 


VAL 


V 



PARTIAL AGONISTS shall mean moieties which activate the intracellular response 
when they bind to the receptor to a lesser degree/extent than do agonists, or enhance GTP 
binding to membranes to a lesser degree/extent than do agonists. 
5 ANTAGONIST shall mean moieties that competitively bind to the receptor at the 

same site as the agonists but which do not activate the intracellular response initiated by the 
active form of the receptor, and can thereby inhibit the intracellular responses by agonists or 
partial agonists. ANTAGONISTS do not diminish the baseline intracellular response in the 
absence of an agonist or partial agonist. 
10 CANDIDATE COMPOUND shall mean a molecule (for example, and not limitation, 

a chemical compound) which is amenable to a screening technique. 

COMPOUND EFFICACY shall mean a measurement of the ability of a compound to 
inhibit or stimulate receptor functionality, as opposed to receptor binding affinity. 

CONSTITUTIVELY ACTIVATED RECEPTOR shall mean a receptor subject to 
1 5 constitutive receptor activation. 

CONSTITUTIVE RECEPTOR ACTIVATION shall mean stabilization of a 
receptor in the active state by means other than binding of the receptor with its endogenous 
ligand or a chemical equivalent thereof. 

CONTACT or CONTACTING shall mean bringing at least two moieties together, 
20 whether in an in vitro system or an in vivo system. 

ENDOGENOUS shall mean a material that a mammal naturally produces. 
ENDOGENOUS in reference to, for example and not limitation, the term "receptor" shall 
mean that which is naturally produced by a mammal (for example, and not limitation, a 
human) or a virus. 

25 In contrast, the term NON-ENDOGENOUS in this context shall mean that which is 

not naturally produced by a mammal (for example, and not limitation, a human) or a virus. For 
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example, and not limitation, a receptor which is not constitutively active in its endogenous 
form, but when manipulated becomes constitutively active, is most preferably referred to 
herein as a "non-endogenous, constitutively activated receptor." Both terms can be utilized to 
describe both "in vivo" and "in vitro" systems. For example, and not a limitation, in a 

5 screening approach, the endogenous or non-endogenous receptor may be in reference to an in 
vitro screening system. As a further example and not limitation, where the genome of a 
mammal has been manipulated to include a non-endogenous constitutively activated receptor, 
screening of a candidate compound by means of an in vivo system is viable. 

INHIBIT or INHIBITING, in relationship to the term "response" shall mean that a 

10 response is decreased or prevented in the presence of a compound as opposed to in the absence 
of the compound. 

INVERSE AGONISTS shall mean moieties that bind the endogenous form of the 
receptor or to the constitutively activated form of the receptor, and which inhibit the baseline 
intracellular response initiated by the active form of the receptor below the normal base level 
15 of activity which is observed in the absence of agonists or partial agonists, or decrease GTP 
binding to membranes. Preferably, the baseline intracellular response is inhibited in the 
presence of the inverse agonist by at least 30%, more preferably by at least 50%, and most 
preferably by at least 75%, as compared with the baseline response in the absence of the 
inverse agonist. 

20 LIGAND shall mean an endogenous, naturally occurring molecule specific for an 

endogenous, naturally occurring receptor. 

PHARMACEUTICAL COMPOSITION shall mean a composition comprising at 

least one active ingredient, whereby the composition is amenable to investigation for a 

specified, efficacious outcome in a mammal (for example, and not limitation, a human). Those 
25 of ordinary skill in the art will understand and appreciate the techniques appropriate for 

determining whether an active ingredient has a desired efficacious outcome based upon the 

needs of the artisan. 

STIMULATE or STIMULATING, in relationship to the term "response" shall mean 
that a response is increased in the presence of a compound as opposed to in the absence of the 
30 compound. 

DETAILED DESCRIPTION 
I. Particularly preferred mutations 
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For convenience, the sequence information regarding the non-endogenous, 
constitutively active human 5-HT2A and 5-HT2C receptors are referred to by identifiers as set 
forth in Table 2: 



TABLE 2 



IDENTIFIER 


RECEPTOR 


SEQ.ID.NO: 


FIGURE 


AP-1 cDNA 


5-HT2C 


28 


5a 


AP-1 


5-HT2C 


29 


5b 


AP-3 cDNA 


5-HT2A 


30 


6a 


AP-3 


5-HT2A 


31 


6b 


AP-4 cDNA 


5-HT2A 


32 


7a 


AP-4 


5-HT2A 


33 


7b 



5 As will be discussed in greater detail below, a mutation analogous to that reported by Casey 
(C322K) was utilized in the human 5-HT2A receptor and is referred to herein as AP-2. 
However, AP-2 did not lead to sufficient constitutive activation to allow for utilization in 
screening techniques. 
II. Introduction 

10 While it is sometimes possible to make predictions as to the effect of nucleic acid 

manipulation from one species to another, this is not always the case. The results reported by 
Casey suggest that a point mutation in the rat 5-HT2A receptor evidences constitutive 
activation of the mutated receptor. Casey reports that the C322K mutation was approximately 
four fold more active than the native rat 5-HT2A receptor. However, for purposes of a most 

15 preferred use, i.e., screening of candidate compounds, this corresponding mutation in the 
human 5-HT2A receptor had little discernable effect in evidencing constitutive activation of 
the human receptor. This, of course, creates the reasonable conclusion that the information 
reported in Herrick-Davis 1 or Herrick-Davis 2 is of limited predictive value relative to the 
manipulation of the human 5-HT2C receptor. Consequently, the ability to make reasonable 

20 predictions about the effects of mutations to the rat 5-HT receptors vis-a-vis the corresponding 
human receptors is not possible. Nonetheless, this unfortunate lack of reasonable predictability 
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provides the opportunity for others to discover mutations to the human 5-HT receptors that 
provide evidence of constitutive activation. 

Therefore, the present invention is based upon the desire of defining mutated 
sequences of the human serotonin receptors 5-HT2A and 5-HT2C whereby such mutated 
versions of the expressed receptor are constitutively active. These constitutively active 
receptors allow for, inter alia, screening candidate compounds. 

What has been discovered and disclosed herein is that substantial activation of the 
human 5-HT2A receptor can be obtained by "domain swapping," i.e., by switching the third 
intracellular domain of the 5-HT2A receptor with the third intracellular domain of the 5- 
HT2C receptor. Additionally, swapping the cytoplasmic tail of the two receptors further 
increases the IP3 response. Furthermore, mutation of the serine at position 310 to lysine 
(S310K) of the human 5-HT2C receptor leads to constitutive activation. 

What follows is a most preferred approach to identification of candidate compounds; 
those in the art will readily appreciate that the particular order of screening approaches, 
and/or whether or not to utilize certain of these approaches, is a matter of choice. Thus, the 
order presented below, set for presentational efficiency and for indication of the most 
preferred approach utilized in screening candidate compounds, is not intended, nor is to be 
construed, as a limitation on the disclosure, or any claims to follow. 
III. Generic G Protein-Coupled Receptor screening assay techniques 

When a G protein receptor becomes constitutively active, it binds to a G protein (Gq, 
Gs, Gi, Go) and stimulates the binding of GTP to the G protein. The G protein then acts as a 
GTPase and slowly hydrolyzes the GTP to GDP, whereby the receptor, under normal 
conditions, becomes deactivated. However, constitutively activated receptors continue to 
exchange GDP to GTP. A non-hydrolyzable analog of GTP, [ 35 S]GTPyS, can be used to 
monitor enhanced binding to membranes which express constitutively activated receptors. It is 
reported that [ 35 S]GTPyS can be used to monitor G protein coupling to membranes in the 
absence and presence of ligand. An example of this monitoring, among other examples well- 
known and available to those in the art, was reported by Traynor and Nahorski in 1995. The 
preferred use of this assay system is for initial screening of candidate compounds because the 
system is generically applicable to all G protein-coupled receptors regardless of the particular 
G protein that interacts with the intracellular domain of the receptor. 
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IV. Confirmation of G Protein-Coupled Receptor site screening assay techniques 

Once candidate compounds are identified using the "generic" G protein-coupled 
receptor assay (i.e. an assay to select compounds that are agonists, partial agonists, or inverse 
agonists), further screening to confirm that the compounds have interacted at the receptor site 
is preferred. For example, a compound identified by the "generic" assay may not bind to the 
receptor, but may instead merely "uncouple" the G protein from the intracellular domain. 
Thus, by further screening those candidate compounds, which have been identified using a 
"generic" assay in an agonist and/or antagonist competitive binding assay, further refinement 
in the selection process is provided. 

Lysergic acid diethylamide (LSD) is a well-known agonist of the 5-HT2A and 5-HT2C 
receptors, while mesulergine is a well-known antagonist to the 5-HT2C receptor. Accordingly, 
in most preferred embodiments, an agonist (LSD) and/or antagonist (mesulergine) competitive 
binding assay(s) is used to further screen those compounds selected from the "generic" assay 
for confirmation of serotonin receptor binding. 

V. Specified G Protein assay techniques 

The art-accepted physiologically mediated pathway for the human 5-HT2A and 5- 
HT2C receptors is via Gq. Intracellular accumulation of IP3 can be used to confirm 
constitutive activation of these types of Gq coupled receptors (see Herrick-Davis-1). As a 
result, "IP3 accumulation" assays can be used to further screen those compounds selected from 
an agonist and/or antagonist competitive binding assay. 

VI. Pharmaceutical compositions 

Candidate compounds selected for further development can be formulated into 
pharmaceutical compositions using techniques well known to those in the art. Suitable 
pharmaceutically-acceptable carriers are available to those in the art; for example, see 
Remington's Pharmaceutical Sciences, 16 th Edition, 1980, Mack Publishing Co., (Oslo et aL, 
eds.) 

EXAMPLES 

The following examples are presented for purposes of elucidation, and not 
limitation, of the present invention. While specific nucleic acid and amino acid sequences 
are disclosed herein, those of ordinary skill in the art are credited with the ability to make 
minor modifications to these sequences while achieving the same or substantially similar 
results reported below. It is intended that equivalent, non-endogenous, constitutively 
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activated human serotonin receptor sequences having eighty-five percent (85%) homology, 
more preferably having ninety percent (90%) homology, and most preferably having ninety- 
five percent (95%) homology to the disclosed and claimed sequences all fall within the 
scope of any claims appended hereto. 
5 Example 1 



Generation of Non-Endogenous, Constitutively Activated 
Human Serotonin Receptors 5-HT2C and 5-HT2A 

10 

A. Construction of constitutively active 5-HT2C receptor cDNA 
1. Endogenous Human 5-HT2C 

The cDNA encoding endogenous human 5-HT2C receptor was obtained from 
human brain poly-A + RNA by RT-PCR. The 5' and 3' primers were derived from the 5' 
15 and 3' untranslated regions and contained the following sequences: 

5 '-GACCTCGAGGTTGCTTAAGACTGAAGCA-3 9 (SEQ.ID.NO:l) 
5 ' - ATTTCTAG AC ATATGTAGCTTGTACCGT-3 ' (SEQ.ID.NO:2) 
PCR was performed using either TaqPlus™ precision polymerase (Stratagene) or rTth™ 
polymerase (Perkin Elmer) with the buffer systems provided by the manufacturers, 0.25 [iM of 
20 each primer, and 0.2 mM of each of the four (4) nucleotides. The cycle condition was 30 
cycles of 94°C for 1 minute, 57 °C for 1 minute and 72 °C for 2 minutes. The 1.5 kb PCR 
fragment was digested with Xho I and Xba I and subcloned into the Sal I-Xba I site of 
pBluescript. 

The derived cDNA clones were fully sequenced and found to correspond to 
25 published sequences. 
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2. AP-1 cDNA 

The cDNA containing a S310K mutation (AP-1 cDNA) in the third intracellular loop 
of the human 5-HT2C receptor was constructed by replacing the Sty I restriction fragment 
containing amino acid 310 with synthetic double stranded oligonucleotides encoding the 
5 desired mutation. The sense strand sequence utilized had the following sequence: 

5'- 

CTAGGGGCACCATGCAGGCTATCAACAATGAAAGAAAAGCTAAGAAAGTC-3' 
(SEQ. ID.NO: 3) 

and the antisense strand sequence utilized had the following sequence: 
1 0 5 '-CAAGGACTTTCTTAGCTTTTCTTTCATTGTTGATAGCCTGCATGGT 

GCCC-3' (SEQ. ID. NO: 4). 

B. Construction of constitutively active 5-HT2A receptor cDNA 

1. Endogenous Human 5-HT2A 

The cDNA encoding endogenous human 5-HT2A receptor was obtained by RT-PCR 
15 using human brain poly-A + RNA; a 5' primer from the 5' untranslated region with a Xho I 
restriction site: 

5'-GACCTCGAGTCCTTCTACACCTCATC-3' (SEQ.ID.NO:5) 
and a 3' primer from the 3' untranslated region containing an Xba I site: 

5 '-TGCTCTAGATTCCAGATAGGTGAAAA CTTG-3' (SEQ.ID.NO:6). 

20 PCR was performed using either TaqPlus™ precision polymerase (Stratagene) or rTth™ 
polymerase (Perkin Elmer) with the buffer systems provided by the manufacturers, 0.25 
of each primer, and 0.2 mM of each of the four (4) nucleotides. The cycle condition was 30 
cycles of 94°C for 1 minute, 57 °C for 1 minute and 72 °C for 2 minutes. The 1 .5 kb PCR 
fragment was digested with Xba I and subcloned into the Eco RV-Xba I site of pBluescript. 

25 The resulting cDNA clones were fully sequenced and found to encode two amino 

acid changes from the published sequences. The first change is a T25N mutation in the N- 
terminal extracellular domain and the second change is an H452Y mutation. These 
mutations are likely to represent sequence polymorphisms rather than PCR errors since the 
cDNA clones having the same two mutations were derived from two independent PCR 

30 procedures using Taq polymerase from two different commercial sources (TaqPlus™ 
Stratagene and rTth™ Perkin Elmer). 

2. Human 5-HT2A (C322K; AP-2) 
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The cDNA containing the point mutation C322K in the third intracellular loop was 
constructed by using the Sph I restriction enzyme site, which encompasses amino acid 322. 
For the PCR procedure, a primer containing the C322K mutation: 

5 '-CAAAGAAAGTACTGGGCATCGTCTTCTTCCT-3 ' (SEQ.ID.NO:7) 
5 was used along with the primer from the 3' untranslated region set forth above as 
SEQ.ID.NO:6. The resulting PCR fragment was then used to replace the 3' end of the wild 
type 5-HT2A cDNA by the T4 polymerase blunted Sph I site. PCR was performed using 
pfu polymerase (Stratagene) with the buffer system provided by the manufacturer and 1 0% 
DMSO, 0.25 mM of each primer, 0.5mM of each of the 4 nucleotides. The cycle conditions 
10 were 25 cycles of 94°C for 1 minute, 60°C for 1 minute and 72°C for 1 minute. 
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5. AP-3 cDNA 

The human 5-HT2A cDNA with intracellular loop 3 (IC3) or IC3 and cytoplasmic 
tail replaced by the corresponding human 5-HT2C cDNA was constructed using PCR-based 
mutagenesis. 

5 (a) Replacement of IC3 Loop 

The IC3 loop of human 5-HT2A cDNA was first replaced with the corresponding 
human 5-HT2C cDNA. Two separate PCR procedures were performed to generate the two 
fragments, Fragment A and Fragment B, that fuse the 5-HT2C IC3 loop to the 
transmembrane 6 (TM6) of 5-HT2A. The 237 bp PCR fragment, Fragment A 9 containing 5- 

10 HT2C IC3 and the initial 13 bp of 5-HT2A TM6 was amplified by using the following 
primers: 

5 ' -CCGCTCGAGTACTGCGCCGAC A AGCTTTGAT-3 ' (SEQ.ID.NO:8) 
5'-CGATGCCCAGCACTTTCGAAGCTTTTCTTTCATTGTTG3'(SEQ.ID.NO:9) 
The template used was human 5-HT2C cDNA. 
15 The 529 bp PCR fragment, Fragment B, containing the C-terminal 13 bp of IC3 

from 5-HT2C and the C-terminal of 5-HT2A starting at beginning of TM6, was amplified 
by using the following primers: 

5 5 - AAA AGCTTCG AA AGTGCTGGGC ATCGTCTTCTTCCT-3 ' (SEQ.ID.NO:10) 
5 9 -TGCTCTAGATTCC AGATAGGTG AAAACTTG-3 ' (SEQJD.NO: 11) 
20 The template used was human 5-HT2A cDNA. 

Second round PCR was performed using Fragment A and Fragment B as co- 
templates with SEQ.ID.NO:8 and SEQJD.NO: 11 (it is noted that the sequences for 
SEQ.ID.NOS.: 6 and 11 are the same) as primers. The resulting 740 bp PCR fragment, 
Fragment C, contained the IC3 loop of human 5-HT2C fused to TM6 through the end of the 
25 cytoplasmic tail of human 5-HT2A. PCR was performed using pfu™ polymerase 
(Stratagene) with the buffer system provided by the manufacturer, and 10% DMSO, 0.25 raM 
of each primer, and 0.5 mM of each of the four (4) nucleotides. The cycle conditions were 25 
cycles of 94 °C for 1 minute, 57 °C (1st round PCR) or 60 °C (2nd round PCR) for 1 minute, 
and 72 °C for 1 minute (1st round PCR) or 90 seconds. (2nd round PCR). 
30 To generate a PCR fragment containing a fusion junction between the human 5- 

HT2A TM5 and the IC3 loop of 5-HT2C, four (4) primers were used. The two external 
primers, derived from human 5-HT2A, had the following sequences: 
5'-CGTGTCTCTCCTTACTTCA-3' (SEQ.ID.NO:12) 
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The other primer used was SEQ.ID.NO.6 (see note above regarding SEQ.ID.NOS. 6 and 
11). The first internal primer utilized was an antisense strand containing the initial 13 bp of 
IC3 of 5-HT2C followed by the terminal 23 bp derived from TM5 of 5-HT2A: 

5 '-TCGGCGCAGTACTTTGATAGTTAGAAAGTAGGTGAT-3 ' (SEQ.ID.NO:13) 
5 The second internal primer was a sense strand containing the terminal 14 bp derived 

from TM5 of 5-HT2A followed by the initial 24 bp derived from IC3 of 5-HT2C: 

5 ' -TTCTA ACT ATC A A AGTACTGCGCCG AC AAGCTTTG ATG-3 ' 
(SEQ.ID.NO:14). 

PCR was performed using endogenous human 5-HT2A and a co-template, Fragment 
10 C, in a 50 ml reaction volume containing IX pfu buffer, 10% DMSO, 0.5 mM of each of the 
four (4) nucleotides, 0.25 mM of each external primer (SEQ.ID.NOS. 1 1 and 12), 0.06 mM 
of each internal primer (SEQ.ID.NOS. 13 and 14) and 1.9 units of pfu polymerase 
(Stratagene). The cycle conditions were 25 cycles of 94°C for 1 minute, 52°C for 1 minute 
and 72 °C for 2 minutes and 10 seconds. The 1 .3 kb PCR product was then gel purified and 
15 digested with Pst I and Eco RI. The resulting 1 kb Pstl-Eco RI fragment was used to 
replace the corresponding fragment in the endogenous human 5-HT2A sequence to generate 
the mutant 5-HT2A sequence encoding the IC3 loop of 5-HT2C. 

(b) Replacement of the cytoplasmic tail 
To replace the cytoplasmic tail of 5-HT2A with that of 5-HT2C, PCR was 
20 performed using a sense primer containing the C-terminal 22 bp of TM7 of endogenous 
human 5-HT2A followed by the initial 21 bp of the cytoplasmic tail of endogenous human 
5-HT2C: 

5 ' -TTC AGC AGTC A ACCC ACTAGTCT ATACTCTGTTC A AC A A A ATT-3 ' 
(SEQ.ID.NO:15) 

25 The antisense primer was derived from the 3' untranslated region of endogenous human 5- 
HT2C: 

5'-ATTTCTAGACATATGTAGCTTGTACCGT-3' (SEQ.ID.NO:16). 

The resulting PCR fragment, Fragment D, contained the last 22 bp of endogenous 
human 5-HT2A TM7 fused to the cytoplasmic tail of endogenous human 5-HT2C. Second 
30 round PCR was performed using Fragment D and the co-template was endogenous human 
5-HT2A that was previously digested with Acc I to avoid undesired amplification. The 
antisense primer used was SEQ.ID.NO:16 (the sequences for SEQ.ID.NOS. 16 and 2 are the 
same) and the sense primer used was derived from endogenous human 5-HT2A: 
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5 ' - ATC ACCTACTTTCT A ACT A-3 9 (SEQ.ID.NO: 17). 

PCR conditions were as set forth in Example lB3.(a) for the first round PCR, except 
that the annealing temperature was 48 °C and the extension time was 90 seconds. The 
resulting 710 bp PCR product was digested with Apa I and Xba I and used to replace the 
5 corresponding Apa I-Xba I fragment of either (a) endogenous human 5-HT2A, or (b) 5- 
HT2A with 2C IC3 to generate (a) endogenous human 5-HT2A with endogenous human 5- 
HT2C cytoplasmic tail and (b) AP-3, respectively. 

4. AP-4 cDNA 

10 This mutant was created by replacement of the region of endogenous human 5- 

HT2A from amino acid 247, the middle of TM5 right after Pro 246 , to amino acid 337, the 
middle of TM6 just before Pro 338 , by the corresponding region of AP-1 cDNA. For 
convenience, the junction in TM5 is referred to as the "2A-2C junction," and the junction in 
TM6 is referred to as the "2C-2A junction." 

15 Three PCR fragments containing the desired hybrid junctions were generated. The 

5' fragment of 561 bp containing the 2A-2C junction in TM5 was generated by PCR using 
endogenous human 5-HT2A as template, SEQ.ID.NO: 12 as the sense primer, and the 
antisense primer was derived from 13 bp of 5-HT2C followed by 20 bp of 5-HT2A 
sequence: 

20 5 ' -CC ATAATCGTC AGGGG A ATG AAAA ATGAC AC AA-3 ' (SEQ.ID.NO: 1 8) 

The middle fragment of the 323 bp contains endogenous human 5-HT2C sequence 
derived from the middle of TM5 to the middle of TM6, flanked by 13 bp of 5-HT2A 
sequences from the 2A-2C junction and the 2C-2A junction. This middle fragment was 
generated by using AP-1 cDNA as a template, a sense primer containing 13 bp of 5-HT2A 

25 followed by 20 bp of 5-HT2C sequences across the 2A-2C junction and having the 
sequence: 

5 ' - ATTTTTC ATTCCCCTGACG ATTATGGTGATTAC-3 ' (SEQ.ID.NO: 1 9); 
and an antisense primer containing 13 bp of 5-HT2A followed by 20 bp of 5-HT2C 
sequences across the 2C-2A junction and having the sequence: 
30 5 ' -TG ATG A AG A AAGGGC ACC AC ATG ATC AG A AAC A-3 ' (SEQ.ID.NO:20). 

The 3' fragment of 487 bp containing the 2C-2A junction was generated by PCR using 
endogenous human 5-HT2A as a template and a sense primer having the following 
sequence from the 2C-2A junction: 
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5 '-GATCATGTGGTGCCCTTTCTTCATCACAAAC AT-3 ' (SEQ.ID.NO:21) 
and the antisense primer was SEQ.ID.NO:6 see note above regarding SEQ.ID.NOS. 6 and 
11). 

Two second round PCR reactions were performed separately to link the 5' and 

5 middle fragment (5'M PCR) and the middle and 3' fragment (M3' PCR). The 5'M PCR co- 
template used was the 5' and middle PCR fragment as described above, the sense primer 
was SEQ.ID.NO:12 and the antisense primer was SEQ.ID.NO:20. The 5'M PCR procedure 
resulted in an 857 bp PCR fragment. 

The M3' PCR used the middle and M3' PCR fragment described above as the co- 

10 template, SEQJD.NO: 19 as the sense primer and SEQ.ID.NO:6 (see note above regarding 
SEQ.ID.NOS. 6 and 11) as the antisense primer, and generated a 784 bp amplification 
product. The final round of PCR was performed using the 857 bp and 784 bp fragments 
from the second round PCR as the co-template, and SEQ.ID.NO:12 and SEQ.ID.NO: 6 (see 
note above regarding SEQ.ID.NOS. 6 and 11) as the sense and the antisense primer, 

15 respectively. The 1.32 kb amplification product from the final round of PCR was digested 
with Pst I and Eco RI. Then resulting 1 kb Pst I-Eco RI fragment was used to replace the 
corresponding fragment of the endogenous human 5-HT2A to generate mutant 5-HT2A 
with 5-HT2C: C310K/IC3. The Apa I-Xba fragment of AP3 was used to replace the 
corresponding fragment in mutant 5-HT2A with 5-HT2C: C310K/IC3 to generate AP4. 

20 Example 2 

Receptor Expression 



A, pCMV 

Although a variety of expression vectors are available to those in the art, for 
25 purposes of utilization for both the endogenous and non-endogenous receptors discussed 
herein, it is most preferred that the vector utilized be pCMV. This vector was deposited 
with the American Type Culture Collection (ATCC) on October 13, 1998 (10801 University 
Blvd., Manassas, VA 201 10-2209 USA) under the provisions of the Budapest Treaty for the 
International Recognition of the Deposit of Microorganisms for the Purpose of Patent 
30 Procedure. The DNA was tested by the ATCC and determined to be viable. The ATCC has 
assigned the following deposit number to pCMV: ATCC #20335 1 . See Figure 8. 

B. Transfection procedure 
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For the generic assay ([ 35 S]GTPyS; Example 3) and the antagonist binding assay 
(mesulergine; Example 4), transfection of COS-7 or 293T cells was accomplished using the 
following protocol. 

On day one, 5X10 6 COS-7 cells or 1X10 7 293T cells per 150mm plate were plated out. 

5 On day two, two reaction tubes were prepared (the proportions to follow for each tube are per 
plate): tube A was prepared by mixing 20|ag DNA (e.g., pCMV vector; pCMV vector AP-1 
cDNA, etc.) in 1.2ml serum free DMEM (Irvine Scientific, Irvine, CA); tube B was prepared 
by mixing 120|lx1 lipofectamine (Gibco BRL) in 1.2ml serum free DMEM. Tubes A and B 
were then admixed by inversions (several times), followed by incubation at room temperature 

10 for 30-45min. The admixture is referred to as the "transfection mixture". Plated COS-7 cells 
were washed with IX PBS, followed by addition of 10ml serum free DMEM. 2.4ml of the 
transfection mixture was then added to the cells, followed by incubation for 4hrs at 37°C/5% 
CO2. The transfection mixture was then removed by aspiration, followed by the addition of 
25ml of DMEM/10% Fetal Bovine Serum. Cells were then incubated at 37°C/5% C0 2 . After 

1 5 72hr incubation, cells were then harvested and utilized for analysis. 

Example 3 

GTP Membrane Binding Scintillation Proximity Assay 

20 The advantages of using [ 35 S]GTPyS binding to measure constitutive activation are 

that: (a) [ 35 S]GTPyS binding is generically applicable to all G protein-coupled receptors; and 
(b) [ 35 S]GTPyS binding is proximal at the membrane surface, thereby making it less likely to 
pick-up molecules which affect the intracellular cascade. The assay utilizes the ability of G 
protein-coupled receptors to stimulate [ 35 S]GTPyS binding to membranes expressing the 

25 relevant receptors. Therefore, the assay may be used to directly screen compounds at the 
disclosed serotonin receptors. 

Figure 9 demonstrates the utility of a scintillation proximity assay to monitor the 
binding of [ 35 S]GTPyS to membranes expressing the endogenous human 5-HT2C receptor 
expressed in COS cells. In brief, the assay was incubated in 20 mM HEPES, pH 7.4, binding 

30 buffer with 0.3 nM [ 35 S]GTPyS and 12.5 [ig membrane protein and 1 juM GDP for 30 minutes. 
Wheatgerm agglutinin beads (25 pi; Amersham) were then added and the mixture was 
incubated for another 30 minutes at room temperature. The tubes were then centrifuged at 1500 
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x g for 5 minutes at room temperature and then counted in a scintillation counter. As shown in 
Figure 9, serotonin, which as the endogenous ligand activates the 5-HT2C receptor, stimulated 
[ 35 S]GTPyS binding to the membranes in a concentration dependant manner. The stimulated 
binding was completely inhibited by 30 fiM mianserin, a compound considered as a classical 

5 5-HT2C antagonist, but also known as a 5-HT2C inverse agonist. 

Although this assay measures agonist-induced binding of [ 35 S]GTPyS to membranes 
and can be routinely used to measure constitutive activity of receptors, the present cost of 
wheatgerm agglutinin beads may be prohibitive. A less costly but equally applicable 
alternative also meets the needs of large-scale screening. Flash plates and Wallac™ scintistrips 

10 may be used to format a high throughput [ 35 S]GTPyS binding assay. This technique allows 
one to monitor the tritiated ligand binding to the receptor while simultaneously monitoring the 
efficacy via [ 35 S]GTPyS binding. This is possible because the Wallac™ beta counter can 
switch energy windows to analyze both tritium and 35 S-labeled probes. 

Also, this assay may be used for detecting of other types of membrane activation 

15 events that result in receptor activation. For example, the assay may be used to monitor 32 P 
phosphorylation of a variety of receptors (including G protein-coupled and tyrosine kinase 
receptors). When the membranes are centrifuged to the bottom of the well, the bound 
[ 35 S]GTPyS or the 32 P-phosphorylated receptor will activate the scintillant coated on the wells. 
Use of Scinti® strips (Wallac™) demonstrate this principle. Additionally, this assay may be 

20 used for measuring ligand binding to receptors using radiolabeled ligands. In a similar manner, 
the radiolabeled bound ligand is centrifuged to the bottom of the well and activates the 
scintillant. The [ 35 S]GTPyS assay results parallel the results obtained in traditional second 
messenger assays of receptors. 

As shown in Figure 10, serotonin stimulates the binding of [ 35 S]GTPyS to the 

25 endogenous human 5-HT2C receptor, while mianserin inhibits this response. Furthermore, 
mianserin acts as a partial inverse agonist by inhibiting the basal constitutive binding of 
[ 35 S]GTPyS to membranes expressing the endogenous human 5-HT2C receptor. As expected, 
there is no agonist response in the absence of GDP since there is no GDP present to exchange 
for [ 35 S]GTPyS . Not only does this assay system demonstrate 

30 the response of the native 5-HT2C receptor, but it also measures the constitutive activation of 
other receptors. 
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Figure 11A and Figure 11B demonstrate the enhanced binding of [ 35 S]GTPyS to 
membranes prepared from 293T cells expressing the control vector alone, the native human 5- 
HT2C receptor or the AP-1 receptor. The total protein concentration used in the assay affects 
the total amount of [ 35 S]GTPyS binding for each receptor. The c.p.m. differential between the 

5 CMV transfected and the constitutively active mutant receptor increased from approximately 
1000 c.p.m at 10 jag/well to approximately 6-8000 c.p.m. at 75 |ug/well protein concentration, 
as shown in Figure 1 1 . 

The AP-1 receptor showed the highest level of constitutive activation followed by the 
wild type receptor, which also showed enhanced [ 35 S]GTPyS binding above basal. This is 

10 consistent with the ability of the endogenous human 5-HT2C receptor to accumulate 
intracellular IP3 in the absence of 5HT stimulation (Example 5) and is also consistent with 
published data claiming that the endogenous human 5-HT2C receptor has a high natural basal 
activity. Therefore, the AP-1 receptor demonstrates that constitutive activity may be measured 
by proximal [ 35 S]GTPyS binding events at the membrane interface. 

15 Example 4 

SEROTONIN RECEPTOR AGONIST/ANT AGONIST COMPETITIVE BINDING ASSAY 

Membranes were prepared from transfected COS-7 cells (see Example 2) by 
homogenization in 20 mM HEPES and 10 mM EDTA , pH 7.4 and centrifuged at 49,000 x 
g for 15 min. The pellet was resuspended in 20 mM HEPES and 0.1 mM EDTA, pH 7.4, 

20 homogenized for 10 sec. using polytron homogenizer (Brinkman) at 5000 rpm and 
centrifuged at 49,000 x g for 1 5 min. The final pellet was resuspended in 20 mM HEPES 
and 10 mM MgCl 2 , pH 7.4, homogenized for 10 sec. using polytron homogenizer 
(Brinkman) at 5000 rpm. 

Assays were performed in triplicate 200(il volumes in 96 well plates. Assay buffer 

25 (20 mM HEPES and 10 mM MgCl 2 , pH 7.4) was used to dilute membranes, 3 H-LSD, 3 H- 
mesulergine, serotonin (used to define non-specific for LSD binding) and mianserin (used to 
define non-specific for mesulergine binding). Final assay concentrations consisted of InM 
3 H-LSD or InM 3 H-mesulergine, 50|ag membrane protein and lOO^im serotonin or 
mianserin. LSD assays were incubated for 1 hr at 37° C, while mesulergine assays were 

30 incubated for 1 hr at room temperature. Assays were terminated by rapid filtration onto 
Wallac Filtermat Type B with ice cold binding buffer using Skatron cell harvester. The 
radioactivity was determined in a Wallac 1205 BetaPlate counter. 
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Example 5 

Intracellular IP3 Accumulation Assay 

5 For the IP3 accumulation assay, a transfection protocol different from the protocol 

set forth in Example 2 was utilized. In the following example, the protocols used for days 
1-3 were slightly different for the data generated for Figures 12 and 14 and for Figures 13 
and 15; the protocol for day 4 was the same for all conditions. 

A. COS-7 and 293 Cells 

10 On day one, COS-7 cells or 293 cells were plated onto 24 well plates, usually lxl O 5 

cells/well or 2xl0 5 cells/well, respectively. On day two, the cells were transfected by first 
mixing 0.25 jug DNA (see Example 2) in 50 pi serum-free DMEM/well and then 2 |al 
lipofectamine in 50 \i\ serum-free DMEM/well. The solutions ("transfection media") were 
gently mixed and incubated for 15-30 minutes at room temperature. The cells were washed 

15 with 0.5 ml PBS and then 400 jal of serum free media was mixed with the transfection 
media and added to the cells. The cells were then incubated for 3-4 hours at 37°C/5%C0 2 . 
Then the transfection media was removed and replaced with 1 ml/well of regular growth 
media. On day 3, the media was removed and the cells were washed with 0.5 ml PBS. Then 
0.5 ml inositol-free/serum-free media ( GIBCO BRL) was added to each well with 0.25 \xd 

20 of 3 H-myo-inositol/well and the cells were incubated for 16-18 hours overnight at 
37°C/5%C0 2 . Protocol A. 

B. 293 Cells 

On day one, lxlO 7 293 cells per 150mm plate were plated out. On day two, two 
reaction tubes were prepared (the proportions to follow for each tube are per plate): tube A was 

25 prepared by mixing 20^ig DNA (e.g., pCMV vector; pCMV vector AP-1 cDNA, etc.) in 1.2ml 
serum free DMEM (Irvine Scientific, Irvine, CA); tube B was prepared by mixing 120jal 
lipofectamine (Gibco BRL) in 1.2ml serum free DMEM. Tubes A and B were then admixed 
by inversions (several times), followed by incubation at room temperature for 30-45min. The 
admixture is referred to as the "transfection mixture". Plated 293 cells were washed with 

30 1XPBS, followed by addition of 10ml serum free DMEM. 2.4ml of the transfection mixture 
was then added to the cells, followed by incubation for 4hrs at 37°C/5% C0 2 . On day 3, cells 
were trypsinized and counted, followed by plating of lxl 0 6 cells/well (poly D-lysine treated 
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12-well plates). Cells were permitted to adhere to the wells, followed by one wash with 
lxPBS. Thereafter, 0.5 jnCi 3 H-inositol in 1ml inositol-free DMEM was added per well. 
Protocol B. 

On day 4, the cells were washed with 0.5 ml PBS and then 0.45 ml of assay medium 

5 was added containing inositol-free/serum free media, 10 |iM pargyline, 10 mM lithium 
chloride, or 0.4 ml of assay medium and 50 ul of lOx ketanserin (ket) to a final 
concentration of 10|aM. The cells were then incubated for 30 minutes at 37°C. Then the 
cells were washed with 0.5 ml PBS and 200 ul of fresh/icecold stop solution (1M KOH; 18 
mM Na-borate; 3.8 mM EDTA) was added/well. The solution was kept on ice for 5-10 

10 minutes or until the cells were lysed and then neutralized by 200 jal of fresh/ice cold 
neutralization sol. (7.5 % HCL). The lysate was then transferred into 1.5 ml micro- 
centrifuge tubes and 1 ml of chloroform/methanol (1:2) was added/tube. The solution was 
vortexed for 15 seconds and the upper phase was applied to a Biorad AG1-X8 anion 
exchange resin ( 100-200 mesh). The resin was washed with water and 0.9 ml of the upper 

15 phase was loaded onto the column. The column was washed with 10 mis of 5 mM myo- 
inositol and 10 ml of 5 mM Na-borate/60mM Na- formate. The inositol trisphosphates were 
eluted into scintillation vials containing 10 ml of scintillation cocktail with 2 ml of 0.1 M 
formic acid/ 1 M ammonium formate. The columns were regenerated by washing with 10 
ml of 0.1 M formic acid/3M ammonium formate and rinsed twice with dd H 2 0 and stored at 

20 room temperature in water. Results are discussed below. 

Figure 12 is an illustration of IP3 production from the human 5-HT2A receptor 
which was mutated using the same point mutation as set forth in Casey, which rendered the 
rat receptor constitutively active. The results represented in Figure 12, support the position 
that when the point mutation shown to activate the rat receptor is introduced into the human 

25 receptor, little activation of the receptor is obtained that would allow for appropriate 
screening of candidate compounds, with the response being only moderately above that of 
the endogenous human 5-HT2A receptor. Generally, a response of at least 2X above that of 
the endogenous response is preferred. 

Figure 13 provides an illustration comparing IP3 production from endogenous 5- 

30 HT2A receptor and the AP4 mutation. The results illustrated in Figure 13 support the 
position that when the novel mutation disclosed herein is utilized, a robust response of 
constitutive IP3 accumulation is obtained (e.g., over 2X that of the endogenous receptor). 
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Figure 14 provides an illustration of IP3 production from AP3. The results 
illustrated in Figure 14 support the position that when the novel mutation disclosed herein is 
utilized, a robust response of constitutive IP3 accumulation is obtained. 

Figure 1 5 provides bar-graph comparisons of IP3 accumulation between endogenous 
5 human 5-HT2C receptor and AP-1. Note that the endogenous receptor has a high degree of 
natural constitutive activity relative to the control CMV transfected cells (i.e., the endogenous 
receptor appears to be constitutively activated). 

Example 6 

l o Screening of Compounds Known to have 5-HT2C Antagonist Activity Against 

Non-Endogenous, Constitutively Activated 
Human Serotonin Receptor: AP-1 

A final concentration of 12.5 |ag membranes prepared from COS7 cells {see 
15 Example 2) transiently expressing constitutively active mutant human 5HT2C receptor AP- 
1 were incubated with binding buffer (20 mM HEPES, pH 7.4, 100 mM NaCl, 20 mM 
MgCl2.6H 2 0, 0.2% saponin, and 0.2 mM ascobate), GDP(ljiM) and compound in a 96-well 
plate format for a period of 60 minutes at ambient room temperature. Plates were then 
centrifuged at 4,000 rpm for 15 minutes followed by aspiration of the reaction mixture and 
20 counting for 1 minute in a Wallac™ MicroBeta plate scintillation counter. A series of 
compounds known to possess reported 5HT2C antagonist activity were determined to be 
active in the [ 35 S]GTPyS binding assay using AP-1. IC50 determinations were made for 
these commercially available compounds (RBI, Natick, MA). Results are summarized in 
Table 3. For each determination, eight concentrations of test compounds were tested in 
25 triplicate. The negative control in these experiments consisted of AP-1 receptor without test 
compound addition, and the positive control consisted of 12.5 jag/well of COS7 cell 
membranes expressing the CMV promoter without expressed AP-1 receptor. 



TABLE 3 


Test Compound 


Known Pharmacology 


IC50 (nM) in GTP-y-[ 35 S] 






Assay 


Metergoline 


5HT2/1C antagonist 


32.0 
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Mesulergine 


5HT2/1C antagonist 


21.2 


Methysergide 


5HT2/1C antagonist 


6.1 


Methiothepin 


5HT1 antagonist 


20.4 


Normethylclozapin 


5HT2/1C antagonist 


21.4 


Fluoxetine 


5HT reuptake inhibitor 


114.0 


Ritanserin 


5HT2/1C antagonist 


19.4 



The IC50 results confirm that the seven tested compounds showed antagonist activity at the AP- 
1 receptor. 

Example 7 

5 SCREENING OF CANDIDATE COMPOUNDS AGAINST NON-ENDOGENOUS, 
CONSTITUTIVELY ACTIVATED HUMAN SEROTONIN RECEPTORS:AP-l 



Approximately 5,500 candidate compounds (Tripos, Inc., St. Louis, MO) were 
screened using the assay protocol of Example 3 (with AP-1 mutant receptor) for identification 
10 as inverse agonists against the receptor; for this assay, an arbitrary cut-off of at least 50% 
inhibition was established for identification of inverse agonists. Approximately 120 of these 
compounds evidenced at least 50% inhibition of [ 35 S]GTPyS binding at 10 juM candidate 
compound (data not shown). 

Example 8 

15 SCREENING OF SELECTED COMPOUNDS TO CONFIRM RECEPTOR 

BINDING: AP I 

The candidate compounds identified from Example 7 were then screened using the 
assay protocol of Example 4 (mesulergine), using the AP-1 mutant receptor. IC 50 (nM) 
20 values were determined; five of the nearly 120 compounds of Example 7 were determined 
to have potent binding affinity for the receptor. Results are summarized in Table 4. 
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Table 4 



Candidate Compound 


IC50 (nM) in Mesulergine 

Assay 


102461 


205.0 


102788 


46.5 


100341 


209.0 


100431 


147.0 


103487 


1,810.0 



Example 9a 
GENERAL SCREENING PARADIGM: 
5 SELECTION OF PRE CLINICAL CANDIDATE LEADS 

The "primary" screen designed to directly identify human 5HT2a/5HT2c receptor 
inverse agonists consisted of a membrane-based GTPyS binding assay utilizing membranes 
prepared from COS7 cells transiently transfected with AP-1 human receptor. Candidate 

10 compounds (10|aM final assay concentration) directly identified as inhibiting receptor- 
mediated increases in GTPyS binding by greater than 50-75% (arbitrary cut-off value) were 
considered active "hits". Primary assay hits were then re-tested in the same assay to 
reconfirm their inverse agonist activity. If primary assay hits were reconfirmed active (50% 
or greater inhibition), and therefore directly identified as, e.g., an inverse agonist, one of 

15 two approaches were available: (a) so-called "directed libraries" could be created, i.e., 
additional candidate compounds were synthesized based upon the structures of the 
reconfirmed hits (geared towards, e.g., improvement in the characteristics of the 
compounds) whereby the directed library compounds were then evaluated for the ability to 
compete for radioligand binding to both mutant 5HT2C (AP-1) and endogenous 5HT2A 

20 receptors, or (b) the reconfirmed hits were then evaluated for the ability to compete for 
radioligand binding to both mutant 5HT2C (AP-1) and endogenous 5HT2A receptors. 
Thus, when approach (a) was used, because these directed library candidate compounds 
were based upon the structures of compounds that were directly identified from the 
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membrane-based GTPyS binding assay, the directed library compounds were not re-tested 
in the membrane-based GTPyS binding assay but rather were then confirmed via the 
radioligand binding analysis. The radioligand binding analysis tests were initially performed 
at lOpM test compound in triplicate and if the compound inhibited radiolabeled binding by 

5 50% or more, the analysis was followed by eight concentration competition curves to 
determine Ki values. The last step in secondary assay evaluation was to determine if test 
compounds were capable of inhibiting AP-3 receptor-mediated accumulation of inositol 
phosphates (e.g., IP3). This final assay confirms that the directly identified compounds 
retained inverse agonist properties. 

10 Example 9b 

CONSTITUTIVELY ACTIVATED HUMAN 5HT2C RECEPTOR (AP I) 
MEDIATED FACILITATION OF GTPyS BINDING TO COS7 MEMBRANES 

15 This protocol is substantially the same as set forth above in Example 6. 

Primary screening assays measuring GTPyS binding to membranes prepared from 
COS7 cells transiently transfected with human mutated 5HT2C receptor (AP-1) were used 
to directly identify inverse agonists in screening libraries (Tripos, Inc.). Candidate 
compound screens were performed in a total assay volume of 200|ul using scintillant-coated 

20 Wallac Scintistrip™ plates. The primary assay was comprised of the following chemicals 
(at indicated final assay concentrations): 20 mM HEPES, pH 7.4, 100 mM NaCl, 20 mM 
MgCl 2 , 0.2% saponin, 0.2 mM ascorbic acid, l|aM GDP, 0.3 nM GTPy 35 S, and 12.5 jag of 
the above defined membranes. Incubations were performed for 60 minutes at ambient room 
temperature. The binding assay incubation was terminated by centrifugation of assay plates 

25 at 4,000 rpm for 15 minutes, followed by rapid aspiration of the reaction mixture and 
counting in a Wallac MicroBeta™ scintillation counter. 

Primary screening of candidate compounds initially involved testing of 72 test 
compounds per assay plate (96-well plates were utilized), at a final assay concentration of 
10|nM candidate compound, in single replicates. A total of sixteen wells of each plate were 

30 dedicated for an eight concentration clozapine (a confirmed 5HT2C/2A inverse agonist) 
dose response curve (duplicate determinations at each concentration). Finally, a total of five 
assay wells of each plate were dedicated to define the negative control (AP-1 receptor 
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expressing membranes without addition of candidate compounds) and three wells from each 
plate to define the positive control (membranes without AP-1 receptor). 

Reconfirmation experiments involve re-testing candidate compounds in the same 
assay described above, except that candidate compounds were evaluated in triplicate, thus 
5 allowing evaluation of 24 compounds per 96-well assay plate. Similar to the primary assay 
plates, an eight concentration clozapine dose response curve (duplicate determinations at 
each concentration) and the same negative and positive control wells were also included 
within each 96-well plate. 

10 Example 9c(l) 

COMPETITION STUDIES 
MUTATED HUMAN 5HT2C RECEPTOR (AP-1) 

15 Radioligand binding competition experiments were performed in a total assay volume 

of 200|al using standard 96-well microtiter plates. The final assay ingredients consisted of 
assay buffer (20mM HEPES and lOmM MgCl 2 ), InM [ 3 H]mesulergine, and 50p,g of 
membranes (COS7 with AP-1 as defined above). Nonspecific [ 3 H]mesulergine binding was 
defined in the presence of 100|aM mianserin. Incubations were performed for 1 hour at 37°C. 

20 Receptor bound radioligand was resolved from free radioligand by rapid filtration of the assay 
mixture over a Wallac Filtermat™ Type B filter, followed by washing with ice-cold assay 
buffer using a Skatron™ cell harvester. Radioactivity was counted using a Wallac 1205 
BetaPlate™ counter. Each assay plate contained five negative control wells (membranes 
expressing receptor and no candidate compound addition) and three positive control wells 

25 (each containing lOO^iM mianserin). For one concentration tests, candidate compounds were 
diluted into assay buffer and screened at a final concentration of 10|jM, in triplicate. For IC50 
determinations, candidate compounds were diluted in assay buffer and eight different 
concentrations were evaluated, in triplicate. A total of 16 wells were designated for an eight 
concentration mianserin dose response curve evaluation for both assays. 

30 Example 9c(2) 



COMPETITION STUDIES 
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WILD TYPE HUMAN 5HT2A RECEPTOR 

Radioligand binding competition experiments were performed in a total assay volume 
of 200|al using standard 96-well microtiter plates. The final assay ingredients comprised assay 

5 buffer (20mM HEPES and lOmM MgCl 2 ), InM [ 3 H]LSD, and 50|ig of the above-defined 
membranes (COS7 with AP-1). Nonspecific [ 3 H]LSD binding was defined in the presence of 
IOOjliM serotonin. Incubations were performed for 1 hour at 37°C. Receptor bound 
radioligand was resolved from free radioligand by rapid filtration of the assay mixture over a 
Wallac Filtermat™ Type B filter, followed by washing with ice-cold assay buffer using a 

10 Skatron™ cell harvester. Radioactivity was counted using a Wallac 1205 BetaPlate™ counter. 
Each assay plate contained five negative control wells (membranes expressing receptor and no 
candidate compound addition) and three positive control wells (containing 100(^M mianserin). 
For one concentration tests, candidate compounds were diluted into assay buffer and screened 
at a final concentration of IOjliM in triplicate. For IC 50 determinations, candidate compounds 

15 were diluted in assay buffer and eight different concentrations were evaluated in triplicate. A 
total of 16 wells were designated for an eight concentration serotonin dose response curve 
evaluation for both assays. 

Example 9d 

20 RECEPTOR-MEDIATED INOSITOL PHOSPHATE ACCUMULATION 

Candidate compound identified in the assays of Examples 9a-9c were then evaluated 
for inositol phosphate accumulation, following the protocol of Example 5 (COS7 cells 
expressing human mutated 5HT2A receptor, AP-3), modified as follows: tube A was 

25 prepared by mixing 16 ^ig DNA (e.g., pCMV vector; pCMV vector AP-1 cDNA, etc.) in 1 .0ml 
serum free DMEM (Irvine Scientific, Irvine, CA); tube B was prepared by mixing 60\xl 
lipofectamine (Gibco BRL) in 1.0 ml serum free DMEM. Tubes A and B were then admixed 
by inversions (several times), followed by incubation at room temperature for 30 min. The 
admixture is referred to as the "transfection mixture". Plated 293 cells were washed with 10 

30 ml Serum Free DMEM, followed by addition of 1 1 ml Serum Free DMEM. 2.0 ml of the 
transfection mixture was then added to the cells, followed by incubation for 5hrs at 37°C/5% 
C0 2 . On day 3, cells were trypsinized and counted, followed by plating of lxlO 6 cells/well 
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(12-well plates). Cells were permitted to adhere to the wells for 8 hrs., followed by one wash 
with lx PBS. Thereafter, 0.5 pCi 3 H-inositol in 1 ml inositol-free DMEM was added per well. 

On day 4, the cells were washed with 1.5 ml PBS and then 0.9 ml of assay medium 
was added containing inositol-free/serum free media, 10 \iM pargyline, 10 mM lithium 

5 chloride, for 5 min in 37°C/5% C0 2 followed by 100 \x\ addition of candidate compound 
diluted in the same material. The cells were then incubated for 120 minutes at 37°C. Then 
the cells were washed with 1.5 ml PBS and 200 jil of fresh/icecold stop solution (1M KOH; 
18 mM Na-borate; 3.8 mM EDTA) was added/well. The solution was kept on ice for 5-10 
minutes or until the cells were lysed and then neutralized by 200 \il of fresh/ice cold 

10 neutralization sol. (7.5 % HCL). The lysate was then transferred into 1.5 ml micro- 
centrifuge tubes and 1 ml of chloroform/methanol (1:2) was added/tube. The solution was 
vortexed for 15 seconds and the upper phase was applied to a Biorad AG1-X8 anion 
exchange resin ( 100-200 mesh). The resin was washed with water and 0.9 ml of the upper 
phase was loaded onto the column. The column was washed with 10 mis of 5 mM myo- 

15 inositol and 10 ml of 5 mM Na-borate/60mM Na- formate. The inositol trisphosphates were 
eluted into scintillation vials containing 10 ml of scintillation cocktail with 2 ml of 0.1 M 
formic acid/ 1 M ammonium formate. The columns were regenerated by washing with 10 
ml of 0.1 M formic acid/3M ammonium formate and rinsed twice with dd H 2 0 and stored at 
room temperature in water. 

20 Following this round of assaying, candidate compounds having an IC50 value of less 

than IOjjM were considered as potential leads for the development of pharmaceutical 
compositions. 

SCREENING CANDIDATE COMPOUNDS 

25 Following the protocols set forth above, one compound, 103487 (Example 8, supra) 

evidenced the following results: 



Figure 
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15A 


-1% 


31% 


2100 


46 


52 


(103487) 






850 




90 



Based upon these results, structure activity analysis of the 103487 compound 
suggested that a series of derivatives of 3-(4-bromo-l-methylpyrazole-3-yl)phenylamine 
would exhibit similar 5-HT2A activity and selectivity. A series of derivatives of 3-(4- 
5 bromo-l-methylpyrazole-3-yl)phenylamine have now been synthesized. These "directed" 
library compounds (Tripos, Inc.) were then analyzed in accordance with the protocols of 
Examples 9c(l), 9c(2) and 9d. 

This series of compounds exhibits highly selective 5-HT 2 a activity. Accordingly, in 
the first aspect of the invention, a series of compounds possessing 5-HT 2 a receptor activity 
10 that are useful as inverse agonists at such receptors is designated by the general formula 
(A): 




(A) 

Wherein: 



15 



W is lower alkyl (Ci_ 6 ), or halogen; 
V is lower alkyl (Ci- 6 ), or halogen; 
X is either Oxygen or Sulfur; 
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Y is NR 2 R\ or (CH 2 ) m R 4 , or 0(CH 2 ) n R 4 ; 
Z is lower alkyl (Ci-6); 
m = 0 -4 
n =0-4 

R 1 is H or lower alkyl (Cm); 
R 2 is H or lower alkyl(Ci_ 4 ); 

R 3 and R 4 are independently a Ci. 6 alkyl, or C 2 . 6 alkenyl, or cycloalkyl, or 
aryl group and each said group may be optionally substituted by up to four 
substituents in any position independently selected from CF 3 , CC1 3 , Me, N0 2 , OH, 
OMe, OEt, CONR 5 R 6 , NR 5 R 6 , OCF 3 , SMe, COOR 7 , S0 2 NR 5 R 6 , SO3R 7 , COMe, 
COEt, CO-lower alkyl, SCF 3 CN, C 2 - 6 alkenyl, H, halogens, C M alkoxy, C 3 . 6 
cycloalkyl, Ci_6 alkyl, aryl, and aryloxy wherein each of the C 3 . 6 cycloalkyl, Ci_ 6 
alkyl, aryl, or aryloxy groups may be further optionally substituted by up to four 
substituents in any position independently selected from CF 3 , CC1 3 , Me, N0 2 , OH, 
OMe, OEt, CONR 5 R 6 , NR 5 R 6 , NHCOCH 3 , OCF3, SMe, COOR 7 , SO3R 7 , 
S0 2 NR 5 R 6 , COMe, COEt, CO-lower alkyl, SCF 3 , CN, C 2 . 6 alkenyl, H, halogens, Ci_ 
4 alkoxy, C 3 . 6 cycloalkyl, Ci_ 6 alkyl, and aryl; 

R 5 and R 6 are independently a H, or Ci_ 6 alkyl, or C 2 . 6 alkenyl, or 
cycloalkyl, or aryl, or CH 2 aryl group and each said group may be optionally 
substituted by up to four substituents in any position independently selected from 
CF 3 , CC1 3 , Me, N0 2 , OH, OMe, OEt, CONR 7 R 8 , NR 7 R 8 , NHCOCH 3 , OCF 3 , SMe, 
COOR 9 , S0 3 R 7 , S0 2 NR 7 R 8 , COMe, COEt, CO-lower alkyl, SCF 3 , CN, C 2 . 6 alkenyl, 
H, halogens, Ci- 4 alkoxy, C 3 . 6 cycloalkyl, Ci. 6 alkyl, and aryl wherein each of the 
C 3 . 6 cycloalkyl, Ci- 6 alkyl, or_aryl groups may be further optionally substituted by 
up to four substituents in any position independently selected from CF 3 , CC1 3 , Me, 
N0 2 , OH, OMe, OEt, CONR 8 R 9 , NR 8 R 9 , NHCOCH 3 , OCF 3 , SMe, COOR 7 , 
S0 2 NR 8 R 9 , S0 3 R 7 , COMe, COEt, CO-lower alkyl, SCF 3 , CN, C 2 . 6 alkenyl, H, 
halogens, Cm alkoxy, C 3 . 6 cycloalkyl, Cj_ 6 alkyl, and aryl, 

or R 5 andR 6 may form part of a 5, 6 or 7 membered cyclic structure which 
may be either saturated or unsaturated and that may contain up to four heteroatoms 
selected from O, N or S and said cyclic structure may be optionally substituted by up 
to four substituents in any position independently selected from CF 3 , CC1 3 , Me, 
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N0 2 , OH, OMe, OEt, OCF 3 , SMe, COOR 7 , S0 2 NR 8 R 9 , S0 3 R 7 , NHCOCH3, COEt, 
COMe, or halogen; 

R 7 may be independently selected from H or Ci. 6 alkyl; 

R 8 and R 9 are independently a H, or Ci_6 alkyl, or C 2 - 6 alkenyl, or 
cycloalkyl, or aryl, or CH 2 aryl group and each said group may be optionally 
substituted by up to four substituents in any position independently selected from 
halogen, CF 3 , OCF3, OEt, CC1 3? Me, N0 2 , OH, OMe, SMe, COMe, CN, COOR 7 , 
S0 3 R 7 , COEt, NHCOCH 3 , or aryl; 

an aryl moiety can be a 5 or 6 membered aromatic heterocyclic ring (containing up 
to 4 hetero atoms independently selected from N, O, or S) or a 6 membered aromatic non- 
heterocyclic ring or a polycycle; 

Ci-6 alkyl moieties can be straight chain or branched; 

optionally substituted Ci_6 alkyl moieties can be straight chain or branched; 
C 2 -6 alkenyl moieties can be straight chain or branched; and 
optionally substituted C 2 -6 alkenyl moieties can be straight chain or 
branched. 

Examples of suitable Ci_6 alkyl groups include but art not limited to methyl, 
ethyl, n-propyl, i-propyl, n-butyl, and t-butyl. 
Halogens are typically F, CI, Br, and I. 

Examples of 5 or 6 membered ring moieties include, but are not restricted 
to, phenyl, furanyl, thienyl, imidazolyl, pyridyl, pyrrolyl, oxazolyl, isoxazolyl, triazolyl, 
pyrazolyl, tetrazolyl, thiazolyl and isothiazolyl. Examples of polycycle moieties include, 
but are not restricted to, naphthyl, benzothiazolyl, benzofuranyl, benzimidazolyl, quinolyl, 
isoquinolyl, indolyl, quinoxalinyl, quinazolinyl and benzothienyl. 

A more preferred series of compounds possessing 5-HT 2 a receptor activity that are 
useful as inverse agonists at such receptors is designated by the general formula (B): 
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R 1 




(B) 

Wherein: 

W is Me, or Et, or halogen; 

X is either Oxygen or Sulfur; 

Y is NR 2 R 3 , or (CH 2 ) m R 4 , or 0(CH 2 ) n R 4 ; 

Z is lower alkyl (Ci-6); 

m = 0-4 

n =0-4 

R 1 is H or lower alkyl (Cm); 
R 2 is H or lower alky 1(C 1.4); 

R 3 and R 4 are independently a C1-6 alkyl, or C2-6 alkenyl, or cycloalkyl, or 
aryl group and each said group may be optionally substituted by up to four 
substituents in any position independently selected from CF 3 , CC1 3 , Me, N0 2 , OH, 
OMe, OEt, CONR 5 R 6 , NR 5 R 6 , OCF 3 , SMe, COOR 7 , S0 2 NR 5 R 6 , SO s R 7 , COMe, 
COEt, CO-lower alkyl, SCF 3 CN, C 2 . 6 alkenyl, H, halogens, Cm alkoxy, C 3 . 6 
cycloalkyl, C1-6 alkyl, aryl, and aryloxy wherein each of the C 3 -6 cycloalkyl, Ci_ 6 
alkyl, aryl, or aryloxy groups may be further optionally substituted by up to four 
substituents in any position independently selected from CF 3 , CC1 3 , Me, N0 2 , OH, 
OMe, OEt, CONR 5 R 6 , NR 5 R 6 , NHCOCH 3 , OCF3, SMe, COOR 7 , S0 3 R 7 , 
S0 2 NR 5 R 6 , COMe, COEt, CO-lower alkyl, SCF 3 , CN, C 2 . 6 alkenyl, H, halogens, Ci. 
4 alkoxy, C 3 .6 cycloalkyl, C1.6 alkyl, and aryl; 
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R 5 and R 6 are independently aH, orCi. 6 alkyl, or C 2 . 6 alkenyl, or 
cycloalkyl, or aryl, or CH 2 aryl group and each said group may be optionally 
substituted by up to four substituents in any position independently selected from 
CF 3 , CC1 3 , Me, N0 2 , OH, OMe, OEt, CONR 7 R 8 , NR 7 R 8 , NHCOCH3, OCF 3 , SMe, 
COOR 9 , SO3R 7 , S0 2 NR 7 R 8 , COMe, COEt, CO-lower alkyl, SCF 3 , CN, C 2 . 6 alkenyl, 
H, halogens, Ci_ 4 alkoxy, C 3 - 6 cycloalkyl, Ci_ 6 alkyl, and aryl wherein each of the 
C 3 - 6 cycloalkyl, Ci- 6 alkyl, oraryl groups may be further optionally substituted by 
up to four substituents in any position independently selected from CF 3 , CC1 3 , Me, 
N0 2 , OH, OMe, OEt, CONR 8 R 9 , NR 8 R 9 , NHCOCH 3 , OCF 3 , SMe, COOR 7 , 
S0 2 NR 8 R 9 , S0 3 R 7 , COMe, COEt, CO-lower alkyl, SCF 3 , CN, C 2 . 6 alkenyl, H, 
halogens, Cm alkoxy, C 3 _ 6 cycloalkyl, Ci_ 6 alkyl, and aryl, 

or R 5 and R 6 may form part of a 5, 6 or 7 membered cyclic structure which 
may be either saturated or unsaturated and that may contain up to four heteroatoms 
selected from O, N or S and said cyclic structure may be optionally substituted by up 
to four substituents in any position independently selected from CF 3 , CC1 3 , Me, 
N0 2 , OH, OMe, OEt, OCF 3 , SMe, COOR 7 , S0 2 NR 8 R 9 , S0 3 R 7 , NHCOCH 3 , COEt, 
COMe, or halogen; 

R 7 may be independently selected from H or Ci_ 6 alkyl; 

R 8 and R 9 are independently a H, or Ci_ 6 alkyl, or C 2 . 6 alkenyl, or 
cycloalkyl, or aryl, or CH 2 aryl group and each said group may be optionally 
substituted by up to four substituents in any position independently selected from 
halogen, CF 3? OCF3, OEt, CC1 3 , Me, N0 2 , OH, OMe, SMe, COMe, CN, COOR 7 , 
S0 3 R 7 , COEt, NHCOCH 3 , or aryl; 

an aryl moiety can be a 5 or 6 membered aromatic heterocyclic ring (containing up 
to 4 hetero atoms independently selected from N, O, or S) or a 6 membered aromatic non- 
heterocyclic ring or a poly cycle; 

C1-6 alkyl moieties can be straight chain or branched; 

optionally substituted C1-6 alkyl moieties can be straight chain or branched; 
C 2 _6 alkenyl moieties can be straight chain or branched; and 
optionally substituted C 2 -6 alkenyl moieties can be straight chain or 
branched. 

Examples of suitable C1-6 alkyl groups include but art not limited to methyl, 
ethyl, n-propyl, i-propyl, n-butyl, and t-butyl. 
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Halogens are typically F, CI, Br, and I. 

Examples of 5 or 6 membered ring moieties include, but are not restricted 
to, phenyl, furanyl, thienyl, imidazolyl, pyridyl, pyrrolyl, oxazolyl, isoxazolyl, triazolyl, 
pyrazolyl, tetrazolyl, thiazolyl and isothiazolyL Examples of polycycle moieties include, 
but are not restricted to, naphthyl, benzothiazolyl, benzofuranyl, benzimidazolyl, quinolyl, 
isoquinolyl, indolyl, quinoxalinyl, quinazolinyl and benzothienyl. 

A first series of compounds having 5-HT 2 a receptor activity is represented by a 
class (I) of compounds of formula (B) wherein Y=NR 2 R 3 : 

R' R 2 




Wherein: 

Preferably R 1 and R 2 are H. 
Preferably W is Br. 
Preferably X is O. 
Preferably Z is Me. 

Preferably R3 is 4-trifluoromethoxyphenyl or 4-trifluoromethoxy benzyl. 
Preferred compounds are: 
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103487 

N-[3 -(4-bromo- 1 -methy lpy razol-3 -y l)pheny 1] [ { (4-trifluoromethoxy )pheny 1 } am ino] carboxamide 




116115 

N-[3-(4-bromo-l-methylpyrazol-3-yl)phenyl][{(4-trifluoromethoxy)phenyl)meth^ 




N CH 3 

These two compounds demonstrated the following activities using the assay 
protocols defined in the Examples above: 
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Compound Number 


Competitive 
Binding 
AP-1 
([ 3 H]mesulergine) 
IC 50 Value 
(uM) 


Competitive 

Binding 
WT 5HT 2A 
([ 3 H]LSD) 
IC 50 Value 

(HUM) 


Inositol Phosphate 
Accumulation 
AP-3 

IC 50 Value 
(pM) 


103487 


2.1 


.046 


.052 


116115 


1.2 


.45 


.0171 



Additional compounds of formula (B) wherein Y=NR 2 R 3 are set forth below. 
Inositol phosphate accumulation assays evidence the activity of test compounds. Both single 
concentration percentages of control values and IC50 determinations indicate activity. In the 
5 tables below the column legends have the following meanings: 

IP^ % Contol : The values in this column reflect an IP Accumulation Assay where the 
test compounds were evaluated at one concentration of 1 0 jaM. For these assays, the compound 
was diluted into inositol-free Dulbecco's Eagle Media containing 10 |uM pargyline and 10 mM 
LiCl and tested at a final assay concentration of 1 0 |uM, in triplicate. The percent control value 
10 was calculated based on the control in which no test compound was added. 

AP-3 IC so nM : The values in this column reflect an IP accumulation assay in which 
the test compound was evaluated at several different concentrations whereby an IC50 could be 
determined. This column corresponds to the column appearing in the tables above which is 
labeled: Inositol Phosphate Accumulation, AP-3, IC 50 Value (|liM). 
15 WT 5HT t a LSD IC so nM : The values in this column reflect a competitive binding 

assay using LSD. This column corresponds to the column appearing in the tables above 
which is labeled: Competitive Binding, WT 5HT 2A , ([ 3 H]LSD), IC 50 Value ftiM). 

Compounds listed in each of the following tables reference the structures 
immediately preceding the table. A "dash" in the table indicates that no value was 
20 determined. 
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R 3 




Compound 
No. 


R 1 


R 2 


R 3 


R 4 


X 


U 


IPs 
% of 
Control 


IP3 
AP-3 
IC S0 nM 


WT 

5HT 2A 
LSD 
IC 50 nM 


N-[3-(4-bromo-l-methylpyrazol-3-yl)phenyl][(4-methylthiophenyl)amino]carboxamide 


116079 


SCH 3 


H 


H 


H 


O 


NH 


16 


17 


4 


N-[3-(4-bromo- 1 -methylpyrazol-3-yl)phenyl][ (4-chlorophenyl)amino]carboxamide 


116081 


CI 


H 


H 


H 


O 


NH 


10 


3.2 


11 


{ [3-(4-bromo- 1 -me 


thylpyrazol-3-yl)phenyl]amino}-N-(4-fluorophenyl)carboxamide 


116082 


F 


H 


H 


H 


O 


NH 


11 




7 


{[3-(4-bromo-l-methylpyrazol-3-yl)phenyl]amino}-N-[2-(trifluoromethoxy)phenyl]carboxarnide 


116087 


H 


H 


CF 3 0 


H 


O 


NH 


11 




200 


{ [3-(4-bromo- 1 -methylpyrazol-3-yl)phen 


yl]amino}-N-(2-nitrophenyl)carboxamide 


116089 


H 


H 


N0 2 


H 


O 


NH 


27 




238 
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{ [3-(4-bromo- 1 -methylpyrazol-3-yl)phenyl]amino 


} -N-(4-methoxypheny 1 )carboxam ide 


116091 


MeO 


H 


H 


H 


O 


NH 


12 




19 


{[3-(4-bromo-l-methylpyrazol-3-yl)phenyl]amino}-N-(2-methyIphenyl)carboxamide 


116092 


H 


H 


Me 


H 


O 


NH 


32 




131 


{[3-(4-bromo-l-methylpyrazol-3-yI)phenyl]amino}-N-[4-(trifluoromethyl)phenyl]carboxamide 


116097 


CF 3 


H 


H 


H 


O 


NH 


11 




65 


{t3-(4-bromo-l-methylpyrazol-3-yl)phenyl]amino}-N-(3-chlorophenyl)carboxamide 


116105 


H 


CI 


H 


H 


O 


NH 


11 




39 


{ [3 -(4-bromo- 1 -me 


thylpyrazol-3-yl)phenyl]amino}-N-(2-chlorophenyl)carboxamide 


116108 


H 


H 


CI 


H 


O 


NH 


6 




249 


{[3-(4-bromo-l-methylpyrazol-3-yl)phenyl]amino}-N-[4-(methyiethyl)phenyl]carboxamide 


116110 


isopropyl 


H 


H 


H 


O 


NH 


7 




338 


{[3-(4-bromo-l-methylpyrazol-3-yl)phenyl]amino}-N-(3-methoxyphenyl)carboxamide 


116111 


H 


MeO 


H 


H 


O 


NH 


7 




106 


{[3-(4-bromo-l-methylpyrazol-3-yl)phenyl]amino}-N-(3-methylphenyl)carboxamide 


116112 


H 


Me 


H 


H 


O 


NH 


14 




57 


[{3-(4-bromo-l-methylpyrazol-3-yl)phenyl}amino]-N-methyl-N-[4-(trifluoromethoxy)phenyl]carboxamide 


116113 


CF 3 0 


H 


H 


H 


O 


NCH 3 




193 


2 


N-[4-(tert-butyl)phenyl]{[3-(4-bromo-l-methylpyrazol-3-yl)phenyl]amino}carboxamide 


116119 


t-butyl 


H 


H 


H 


O 


NH 


17 




476 


N-[4-(dimethylarnino)phenyl]{[3-(4-bromo-l-rnethylpyrazol-3-yl)phenyl]amino}carboxarnide 


116122 


NMe 2 


H 


H 


H 


O 


NH 


9 




309 
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N-(3,5-dichloro-4-methylphenyl){[3-(4-bromo-l 


-methylpyrazol-3-yl)phenyl]amino}carboxamide 


116138 


Me 


CI 


H 


CI 


O 


NH 


23 




122 




{[3-(4-bromo-l-methylpyrazol-3-yl)phenyl]amino}-N-[4- 










(trifluoromethylthio)phenyl]carboxamide 






116139 


CF 3 S 


H 


H 


H 


O 


NH 


12 




56 


{[3_(4.bromo-l-methyipyrazol-3-yl)phenyl]amino}-N-(2-fluorophenyl)carboxamide 


116144 


H 


H 


F 


H 


O 


NH 


12 




37 


2-({[3-(4-bromo-l-methylpyrazol-3-yl)phenyl]amino}carbonylamino)benzamide 


116145 


H 


H 


CONH 2 


H 


O 


NH 


31 




7473 


{[3-(4-bromo-l-me 


thylpyrazol-3-yl)phenyl]amino}-N-(4-cyanophenyl)carboxamide 


116147 


CN 


H 


H 


H 


O 


NH 


12 




2 


{[3-(4-bromo-l-methylpyrazol-3-yl)phenyl]amino}-N-(2-cyanophenyl)carboxamide 


116148 


H 


H 


CN 


H 


0 


NH 


30 




348 



H H 
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Compound 




IPs 


WT 


No. 


N-[3-(4-bromo-l-methylpyrazol-3- 


AP-3 


5HT 2A 




yl)phenyl][cyclohexylamino]carboxamide 


IC 50 nM 


LSD 








IC50 nM 


116141 




114 


81 




Compound 
No. 


R 1 


R 2 


R 3 


R 4 


R 5 


IP3 
AP-3 
IC 50 nM 


WT 

5HT 2A 
LSD 
IC 50 nM 


N-[3-(4-bromo-l-methylpyrazol-3-yl)phenyl][phenylmethylamino]carboxamide 


116143 


H 


H 


H 


H 


H 


120 


47 


N-[3-(4-bromo- 1 -methylpyrazol-3-yl)phenyl][ {(4-fluorophenyl)methyl } 


amino]carboxamide 


116182 


F 


H 


H 


H 


H 


89 


132 
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N-[3-(4-bromo-l-methylpyrazol-3-yl)phenyl][{(3,4-dimethoxyphenyl)rnethyl}arnino]carboxamide 


116183 


OMe 


OMe 


H 


H 


H 




1010 




N-[3-(4-bromo-l-methylpyrazol-3-yl)phenyl][{(3,4,5 
trimethoxyphenyl)methyl } amino]carboxamide 






116184 


OMe 


OMe 


H 


OMe 


H 




2960 


N-[3-(4-bromo-l-methylpyrazol-3-yl)phenyl][{(2-methylphenyl)methyl}amino]carboxamide 


116185 


H 


H 


Me 


H 


H 




769 


N-[3-(4-bromo-l-methylpyrazol-3-yl)phenyl][{(4-methoxyphenyl)methyl}amino]carboxamide 


116189 


OMe 


H 


H 


H 


H 




102 




Compound 












IP 3 


WT 


No. 


R 1 


R 2 


R 3 


R 4 


R 5 


AP-3 


5HT 2A 














IC50 nM 


LSD 
















IC50 nM 



N-[3-(4-bromo-l-methylpyrazol-3-yl)phenyl][{2K4-methoxyphenyl)ethy 
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116194 


OMe 


H 


H 


H 


H 


32 


61 



A second series of compounds having 5-HT 2 a receptor activity is represented by a 
class (II) of compounds of formula (B) wherein Y= 0(CH 2 ) n R4: 



R 1 




Wherein: 

Preferably R 1 is H. 
Preferably W is Br. 
Preferably X is O. 
Preferably Z is Me. 

Preferably when n = 0, R 4 is 4-methoxyphenyl or tertiary butyl. 
Preferred compounds are: 

116100 

N-[3-(4-bromo- 1 -methylpyrazol-3-y l)phenyl] [4-methoxyphenoxy]carboxamide 
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116192 

(tert-butoxy)-N-[3-(4-bromo-l-methylpyrazol-3-yl)phenyl]carboxamide 



H 




These two compounds demonstrated the following activity: 





Competitive 


Competitive 


Inositol Phosphate 




Binding 


Binding 


Accumulation 




AP-1 


WT 5HT 2A 


AP-3 


Compound No. 


([ 3 H]mesulergine) 


(t 3 H]LSD) 






IC 5 o Value 


IC 50 Value 


IC 50 Value 




(uM) 


(uM) 


(MM) 


116100 


1.8 


<0.001 


0.0003 
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116192 




0.014 


0.057 



In addition to the assays discussed above, the specific activity of 1 16100 at the 
5HT 2A receptor was further confirmed by the following. 

In Vitro Binding of 5HT ? a Receptor 

5 Animals: 

Animals (Sprague-Dawley rats) were sacrificed and brains were rapidly dissected 
and frozen in isopentane maintained at -42°C. Horizontal sections were prepared on a 
cryostat and maintained at -20°C. 
LSD Displacement Protocol: 

10 Lysergic acid diethylamide (LSD) is a potent 5HT2A receptor and dopamine D2 

receptor ligand. An indication of the selectivity of compounds for either or both of these 
receptors involves displacement of radiolabeled-bound LSD from pre-treated brain sections. 
For these studies, radiolabeled I 125 -LSD (NEN Life Sciences, Boston, MA., Catalogue 
number NEX-199) was utilized; spiperone (RBI, Natick, MA. Catalogue number s-128), a 

15 5HT2A receptor and dopamine D2 receptor antagonist, was also utilized. Buffer consisted 
of 50 nanomolar TRIS-HC1, pH 7.4 

Brain sections were incubated in (a) Buffer plus 1 nanomolar I 125 -LSD; (b) Buffer 
plus 1 nanomolar I 125 -LSD and 1 micromolar spiperone; or Buffer plus 1 nanomolar I 125 - 
LSD and 1 micromolar 116100 for 30 minutes at room temperature. Sections were then 

20 washed 2X 10 minutes at 4°C in Buffer, followed by 20 seconds in distilled H 2 0. Slides 
were then air-dried. 

After drying, sections were apposed to x-ray film (Kodak Hyperfilm) and exposed 
for 4 days. 
Analysis: 

25 Figures 16A-C provide representative autoradiographic sections from this study. 

Figure 16A evidences darker bands (derived from I I25 -LSD binding) primarily in both the 
fourth layer of the cerebral cortex (primarily 5HT 2 a receptors), and the caudate nucleus 
(primarily dopamine D2 receptors and some 5HT 2 a receptors). As can be seen from Figure 
16B, spiperone, which is a 5HT 2 a and dopamine D2 antagonist, displaces the I 125 -LSD from 

30 these receptors on both the cortex and the caudate. As can be further seen from Figure 16C, 
116100 appears to selectively displace the I l25 -LSD from the cortex (5HT 2A ) and not the 
caudate (dopamine D2). 
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A third series of compounds having 5-HT 2 a receptor activity is represented by a 
class (III) of compounds of formula (B) wherein Y= (CH 2 ) m R 4 : 



(CH 2 ) m R' 




Wherein: 

Preferably W is Br. 
Preferably X is O. 
Preferably Z is Me. 
Preferably R 1 is H. 

Preferably when m = 0, R 4 is preferably 4-trifluoromethoxyphenyl, or 
thiophene, or 4-chlorophenyl. 



Preferred compounds are: 

116101 

m = 0, R 1 = H ? R 4 = 4-trifluoromethoxyphenyl 
N_[3_(4-bromo-l-methylpyrazol-3-yl)phenyl][4-1rifluoromethoxyphenyl]carboxamid 
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116102 

m = 0, R 1 = H, R 4 = thiophene 
N-[3-(4-bromo-l-methylpyrazol-3-yl)phenyl][2-thienyl]carboxamide 




CH, 



10 



116120 

m = 0, R 1 = H, R 4 = 4-chlorophenyl 
N.[3-(4.bromo-l-methylpyrazol-3-yl)phenyl][4-chlorophenyl]carboxamide 
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These three compounds demonstrated the following activities: 





Competitive 


Competitive 


Inositol Phosphate 




Binding 


Binding 


Accumulation 


Compound Number 


AP-1 


WT 5HT 2A 


AP-3 




([ 3 H]mesulergine) 


([ 3 H]LSD) 






IC 50 Value 


IC 50 Value 


IC 50 Value 




(MM) 




(uM) 


116101 


6.1 


.46 


0.0213 


116102 


2.8 


.17 


0.080 


116120 


1.2 


.21 


0.0315 



5 In Vivo Analysis of Compound 1 16102 

In addition to the in vitro assays shown in the above table, the in vivo response of 
animals to the 1 16102 compound is demonstrated by the following. 

A 5HT 2 a receptor antagonist or inverse agonist is expected to decrease 
amphetamine-stimulated locomotion without affecting baseline locomotion. See, for 
10 example, Soresnon, et al, 266(2) J. Pharmacol. Exp. Ther. 684 (1993). Based upon the 
foregoing information, Compound 1 16102 is a potent inverse agonist at the human 5HT2A 
receptor. For the following study, the following parameters and protocol were utilized: 
Animals, Vehicle 

Adult male Sprague-Dawley rats were utilized for these studies. Animals were 
15 housed in groups of 2-3 in hanging plastic cages with food and water available at all times. 
Animals were weighed and handled for at least one day prior to surgery and throughout the 
studies. For these studies, Vehicle consisted of 90% ethanol (100%) and 10% water. 
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Amphetamine-stimulated locomotor activity: Assessment and Apparatus 

A San Diego Instruments Flex Field apparatus was used to quantify baseline and 
amphetamine-stimulated locomotor activity. This apparatus consists of four 16" x 16" clear 
plastic open fields. Photocell arrays (16 in each dimension) interfaced with a personal 

5 computer to automatically quantify activity. Several measures of activity can be assessed 
with the apparatus, including total photocell beam breaks. Animals (vehicle control and 
Compound treated) were injected s.c. 30 minutes prior to initiation of analysis. Following 
this 30 minute period, animals were placed individually into an open field and baseline 
activity was assessed for 30 minutes (habituation phase). Following baseline, animals were 

10 removed, injected with d-amphetamine sulfate (1.0 mg/kg) and immediately returned to the 
open field for 150 minutes, in order to follow the time course (10 minute intervals) of 
amphetamine-stimulated locomotor activity. 



Dosing 



Vehicle Control 


Compound 116102 


Dose (mg/kg) 


6 animals 


6 animals 


0.1 




6 animals 


1.0 




6 animals 


5.0 




6 animals 


10.0 



15 Analysis 

Results, based upon the number of recorded photobeam breaks (mean +/- sem), are 
presented in Figure 17A-C. As supported by Figures 17A,B and C, a general "inverted U" 
shaped pattern was observed (see, generally, Sahgal, A. "Practical behavioural 
neuroscience: problems, pitfalls and suggestions" pp 1-8, 5 in Behavioral Neuroscience: A 
20 Practical Approach . Volume 1 A. Sahgal (Ed.) 1993, IRL Press, New York). As Figure 17 
also indicates, with exception of the highest dose (lOmg/kg), in vivo, the tested doses of 
Compound 116102 evidenced a decrease in the amphetamine-stimulated locomotion, 
consistent with a 5HT2A receptor antagonist or inverse agonist. 
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(CH 2 ) m R 4 are set forth below. 




Compound 
No. 


R 1 


R 2 


R 3 


R 4 


IP3 
IC S0 nM 


LSD 
IC 50 nM 


N-[3-(4-bromo-l-methylpyrazol-3-yl)phenyl]-2-[4-(trifluoromethoxy)phenyl]acetamide 


116137 


OCF 3 


H 


H 


H 




106 


N-[3-(4-bromo-l 


-methylpyrazol-3-yl)phenyl]-2-(3-fluorophenyI)acetamide 


116174 


H 


F 


H 


H 


153 


318 


N.[3.(4_bromo-l-methylpyrazol-3-yl)phenyl]-2-(3-methoxyphenyl)acetamide 


116175 


H 


OMe 


H 


H 


108 


625 


N-[3-(4-bromo-l 


-methylpyrazol-3-yl)phenyl]-2-(2-fluorophenyl)acetamide 


116176 


H 


H 


F 


H 


129 


662 


N-[3-(4-bromo-l-methylpyrazol-3-yl)phenyl]-2-(4-nitrophenyl)acetamide 


116177 


N0 2 


H 


H 


H 


61 


108 
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N-[3-(4-bromo-l-methylpyrazoI-3-yl)phenyl]-2-(2-methoxyphenyl)acetamide 


116178 


H 


H 


OMe 


H 


165 


2300 



compound names not provided 



Based upon the discovery of the specific inverse agonist activity of the above 
identified compounds at the 5HT2A receptor, a novel class of compounds has been identified 
which exhibits said activity. Accordingly, in the second aspect of the invention, there is 
provided a novel compound of formula (C): 



r 




(C) 



Wherein: 

W is Me, or Et, or halogen; 

X is either Oxygen or Sulfur; 

Y is NR 2 R 3 , or (CH 2 ) m R 4 , or 0(CH 2 ) n R 4 ; 

Z is lower alkyl (Ci_ 6 ); 

m = 0-4; 

n - 0-4; 

R 1 is H or lower alkyl (Cj^X 
R 2 is H or lower alkyl(Ci_4); 

R 3 is a Ci-6 alkyl, or C 2 . 6 alkenyl, or cycloalkyl, or (CH 2 )karyl group (k = 1 - 
4), preferably k = 1, and each said group may be optionally substituted by up to four 
substituents in any position independently selected from CF3, CC1 3 , Me, N0 2 , OH, 
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OMe, OEt, CONR 5 R 6 , NR 5 R 6 , OCF 3 , SMe, COOR 7 , S0 2 NR 5 R 6 , S0 3 R 7 , COMe, 
COEt, CO-lower alkyl, SCF 3 CN, C 2 _ 6 alkenyl, H, halogens, C,_ 4 alkoxy, C 3 _ 6 
cycloalkyl, Ci- 6 alkyl, aryl, and aryloxy wherein each of the C 3 . 6 cycloalkyl, Cm 
alkyl, aryl, or aryloxy groups may be further optionally substituted by up to four 
substituents in any position independently selected from CF 3 , CC1 3 , Me, N0 2 , OH, 
OMe, OEt, CONR 5 R 6 , NR 5 R 6 , NHCOCH 3 , OCF3, SMe, COOR 7 , SO a R 7 , 
S0 2 NR 5 R 6 , COMe, COEt, CO-lower alkyl, SCF 3 , CN, C 2 . 6 alkenyl, H, halogens, Ci- 
4 alkoxy, C 3 . 6 cycloalkyl, Cm alkyl, and aryl; 

R 4 is a Ci-6 alkyl, or C 2 _6 alkenyl, or cycloalkyl, or aryl group and each said 
group may be optionally substituted by up to four substituents in any position 
independently selected from CF 3 , CC1 3 , Me, N0 2 , OH, OMe, OEt, CONR 5 R 6 , 
NR 5 R 6 , OCF 3 , SMe, COOR 7 , S0 2 NR 5 R 6 , S0 3 R 7 , COMe, COEt, CO-lower alkyl, 
SCF 3 CN, C 2 _ 6 alkenyl, H, halogens, Cu alkoxy, C 3 . 6 cycloalkyl, Cm alkyl, aryl, and 
aryloxy wherein each of the C 3 . 6 cycloalkyl, alkyl, aryl, or aryloxy groups may 
be further optionally substituted by up to four substituents in any position 
independently selected from CF 3 , CC1 3 , Me, N0 2 , OH, OMe, OEt, CONR 5 R 6 , 
NR 5 R 6 , NHCOCH 3 , OCF3, SMe, COOR 7 , S0 3 R 7 , S0 2 NR 5 R 6 , COMe, COEt, CO- 
lower alkyl, SCF 3 , CN, C 2 . 6 alkenyl, H, halogens, Cm alkoxy, C 3 . 6 cycloalkyl, Cm 
alkyl, and aryl; 

R 5 and R 6 are independently a H, or Cm alkyl, or C 2 . 6 alkenyl, or 
cycloalkyl, or aryl, or CH 2 aryl group and each said group may be optionally 
substituted by up to four substituents in any position independently selected from 
CF 3 , CC1 3 , Me, N0 2 , OH, OMe, OEt, CONR 7 R 8 , NR 7 R 8 , NHCOCH 3 , OCF 3 , SMe, 
COOR 9 , S0 3 R 7 , S0 2 NR 7 R 8 , COMe, COEt, CO-lower alkyl, SCF 3 , CN, C 2 . 6 alkenyl, 
H, halogens, Cm alkoxy, C 3 . 6 cycloalkyl, Cm alkyl, and aryl wherein each of the 
C 3 _6 cycloalkyl, Cm alkyl, or_aryl groups may be further optionally substituted by 
up to four substituents in any position independently selected from CF 3 , CC1 3 , Me, 
N0 2 , OH, OMe, OEt, CONR 8 R 9 , NR 8 R 9 , NHCOCH 3 , OCF 3 , SMe, COOR 7 , 
S0 2 NR 8 R 9 , S0 3 R 7 , COMe, COEt, CO-lower alkyl, SCF 3 , CN, C 2 . 6 alkenyl, H, 
halogens, Cm alkoxy, C 3 - 6 cycloalkyl, Cm alkyl, and aryl, 

or R 5 and R 6 may form part of a 5, 6 or 7 membered cyclic structure which 
may be either saturated or unsaturated and that may contain up to four heteroatoms 
selected from O, N or S and said cyclic structure may be optionally substituted by up 
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to four substituents in any position independently selected from CF 3? CC1 3 , Me, 
N0 2 , OH, OMe, OEt, OCF 3 , SMe, COOR 7 , S0 2 NR 8 R 9 , S0 3 R 7 , NHCOCH3, COEt, 
COMe, or halogen; 

R 7 may be independently selected from HorCj-6 alkyl; 

R 8 and R 9 are independently a H, or Ci_ 6 alkyl, or C 2 -6 alkenyl, or 
cycloalkyl, or aryl, or CH 2 aryl group and each said group may be optionally 
substituted by up to four substituents in any position independently selected from 
halogen, CF 3 , OCF3, OEt, CC1 3 , Me, N0 2 , OH, OMe, SMe, COMe, CN, COOR 7 , 
S0 3 R 7 , COEt, NHCOCH 3 , or aryl; 

an aryl moiety can be a 5 or 6 membered aromatic heterocyclic ring (containing up 
to 4 hetero atoms independently selected from N, O, or S) or a 6 membered aromatic non- 
heterocyclic ring or a polycycle; 

C1-6 alkyl moieties can be straight chain or branched; 

optionally substituted C1.6 alkyl moieties can be straight chain or branched; 
C 2 _6 alkenyl moieties can be straight chain or branched; and 
optionally substituted C 2 . 6 alkenyl moieties can be straight chain or 
branched; 

with the proviso that said compound is not: 
N-[3-(4-bromo-l-me%lpyrazol-3-yl)phenyl][methylamino]carboxamide, or 
N- [3 -(4-bromo- 1 -methylpyrazol-3 -yl)pheny 1] [ { (4-trifluoromethoxy)pheny 1 } amino] 
carboxamide, or 

N-[3-(4-bromo-l -methylpyrazol-3 -yl)phenyl] [2-chlorophenyl] carboxamide, or 
N-[3-(4-bromo-l-methylpyrazol-3^ or 
N-[3 -(4-bromo- l-methylpyrazol-3-yl)phenyl][trichloromethyl]carboxamide. 

Examples of suitable C1-6 alkyl groups include but art not limited to methyl, 
ethyl, n-propyl, i-propyl, n-butyl, and t-butyl. 
Halogens are typically F, CI, Br, and I. 

Examples of 5 or 6 membered ring moieties include, but are not restricted 
to, phenyl, furanyl, thienyl, imidazolyl, pyridyl, pyrrolyl, oxazolyl, isoxazolyl, triazolyl, 
pyrazolyl, tetrazolyl, thiazolyl and isothiazolyL Examples of polycycle moieties include, 
but are not restricted to, naphthyl, benzothiazolyl, benzofuranyl, benzimidazolyl, quinolyl, 
isoquinolyl, indolyl, quinoxalinyl, quinazolinyl and benzothienyl. 
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Synthetic Approaches 
The compounds disclosed in this invention may be readily prepared according to a 
variety of synthetic manipulations, all of which would be familiar to one skilled in the art. In 
the general syntheses set forth below, the labeled substituents have the same identifications as 
5 set out in the definitions of the compounds above. 

Compounds of general formula (I) can be obtained via a variety of synthetic routes all 
of which would be familiar to one skilled in the art. The reaction of isocyanates with amines is 
a commonly practised method for the formation of ureas (see Org. Syn. Coll. Vol. V, (1973), 
555). Amine (IV), 3-(4-bromo-l-methylpyrazole-3-yl)phenylamine, commercially available 
10 from Maybridge Chemical Company, Catalog No. KM01978, CAS No. 175201-77-1] reacts 
readily with isocyanates (V) in inert solvents such as halocarbons to yield the desired ureas of 
general formula (I) wherein R 1 = R 2 = H: 




Alternatively the amine (IV) can be converted to the corresponding isocyanate (VI) 
15 by the action of phosgene or a suitable phosgene equivalent, e.g. triphosgene, in an inert 
solvent such as a halocarbon in the presence of an organic base such as triethylamine or 
ethyldiisopropylamine. Isocyanate (VI) reacts with amines of general formula (VII), in an 
analogous fashion to that described above for the reaction of (IV) with (V), yielding the 
desired ureas of general formula (I) wherein R 1 = H: 
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(I) R 1 =H 



Alternatively wherein the isocyanate of general formula (V) is not commercially available it 
can be prepared from the corresponding amine of general formula (VIII) in an analogous procedure 
to that described above for the preparation of (VI). Reaction of these isocyanates with (IV) would 
5 again yield the requisite ureas of general formula (I) wherein R 1 = R 2 = H: 
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(D R ] = R 2 = H 



Amines of general formula (VII) are also readily converted to activated isocyanate 
equivalents of general formula (IX) by the sequential action of carbonyldiimidazole and methyl 
iodide in tetrahydrofuran and acetonitrile respectively (R.A. Batey et al 9 Tetrahedron Lett., (1998), 
5 39, 6267-6270.) Reaction of (IX) with (IV) in an inert solvent such as a halocarbon would yield the 
requisite ureas of general formula (I) wherein R 1 = H: 
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HN 




/ 



R 2 
H I 




(T) R' =H 

Amine (IV) may be monomethylated according to the procedure of J. Barluenga et al, J. 
Chem. Soc, Chem. Commun., (1984), 20, 1334-1335, or alkylated according to the procedure of P. 
Marchini et al, J. Org. Chem., (1975), 40(23) , 3453-3456, to yield compounds of general formula 
5 (X) wherein R 1 = lower alkyl. These materials may be reacted as above with reagents of general 
formula (V) and (IX) as depicted below: 
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\ 



CH 3 

(IX) 

! 




(I) R 1 = lower alkyl 

Compounds of general formula (II) can similarly be obtained via a variety of synthetic 
manipulations, all of which would be familiar to one skilled in the art. The reaction of amine (IV) 
with chloroformates (see Org. Syn. Coll. Vol. IV, (1963), 780) of general formula (XI) in an inert 
5 solvent such as ether or halocarbon in the presence of a tertiary base such as triethylamine or 
ethyldiisopropylarnine readily yields the requisite carbamates of general formula (II) wherein R 1 = 
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H. Analogously amines of general formula (X) react similarly with chloroformates (XI) to yield the 
requisite carbamates of general formula (II) wherein R 1 = lower alkyl: 





O 



+ CI 



0(CH 2 )R 4 



(XI) 



Br 



CH 3 

(X) R 1 = lower alkyl 



R' 
I 



pr t 

O 



(CH^R 




N 



N 



\ 



CH 



3 R 1 = lower alkyl 



(II) 



An alternative route employs the ready reaction of an alcohol with an isocyanate. Thus 
isocyanate (VI) described previously reacts readily with alcohols (XII) in an aprotic solvent such as 
ether or chlorocarbon to yield the desired carbamates of general formula (II) wherein R 1 = H: 
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Chloroformates of general formula (XI) not commercially available may be readily prepared 
from the corresponding alcohol (XII) in an inert solvent such as toluene, chlorocarbon or ether by 
the action of excess phosgene (see Org. Syn. Coll. Vol. Ill, (1955), 167): 



O 



4 phosgene j< 




(XII) 



2'n 

(XI) 



Compounds of general formula (III) can be obtained via a variety of synthetic 
manipulations, all of which would be familiar to one skilled in the art. The reaction of amine (IV) 
with acid chlorides (see Org. Syn. Coll. Vol. V, (1973), 336) of general formula (XIII) to yield the 
desired amides (III) wherein R 1 = H is readily achieved in an inert solvent such as chloroform or 
dichloromethane in the presence of an organic base such as triethylamine or ethyldiisopropylamine. 
In an identical fashion amines of general formula (X) would react with acid chlorides (XIII) to yield 
the desired amides (III) wherein R 1 = lower alkyl: 
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CH 3 R 1 = lower alkyl 
(X) R 1 = lower alkyl 

(III) 



Alternatively the corresponding acids of general formula (XIV) may be coupled with 
5 dicyclohexylcarbodiimide (DCC)/hydroxybenzotriazole (HOBT) (see W. Konig et al, Chem. Ber. , 
(1 970), 103, 788) or hydroxybenzotriazole (HOBT)/2-(l H-benzotriazole- 1 -yl)- 1 , 1 ,3,3- 
tetramethyluronium hexafluorophosphate (HBTU) (see M. Bematowicz et al, Tetrahedron Lett., 
(1989), 30, 4645) as condensing agents in dimethylformamide or chloroform to amines (IV) and 
(X) respectively yielding products identical to those described in the previous scheme: 
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The acids of general formula (XIV) are readily converted to the corresponding acid 
chlorides (XIII) by the action of thionyl chloride or oxalyl chloride in the presence of catalytic 
dimethy lformamide : 



O 



O 



SOCL or (COCl) 2 



HO' 



(CHAR 



CI' 



DMF cat. 



(CH 2 )R 4 



(XIV) 



(XIII) 



5 

A third aspect of the present invention provides a compound of formula (A) or a solvate or 
physiologically functional derivative thereof for use as a therapeutic agent, specifically as a 
modifier of the activity of the serotonin 5-HT 2 a receptor. Modifiers of the activity of the serotonin 
5-HT 2 a receptor are believed to be of potential use for the treatment or prophylaxis of CNS, 

10 gastrointestinal, cardiovascular, and inflammatory disorders. Compounds of the formula (A) may 
be administered by oral, sublingual, parenteral, rectal, or topical administration. In addition to the 
neutral forms of compounds of formula (A) by appropriate addition of an ionizable substituent, 
which does not alter the receptor specificity of the compound, physiologically acceptable salts of 
the compounds may also be formed and used as therapeutic agents. Different amounts of the 

15 compounds of formula (A) will be required to achieve the desired biological effect. The amount will 
depend on factors such as the specific compound, the use for which it is intended, the means of 
administration, and the condition of the treated individual. A typical dose may be expected to fall in 
the range of 0.001 to 200 mg per kilogram of body weight of the treated individual. Unit does may 
contain from 1 to 200 mg of the compounds of formula (A) and may be administered one or more 

20 times a day, individually or in multiples. In the case of the salt or solvate of a compound of 
formulas (A), the dose is based on the cation (for salts) or the unsolvated compound. 



25 
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A fourth aspect of the present invention provides pharmaceutical compositions 
comprising at least one compound of formula (A) and/or a pharmacologically acceptable 
5 salt or solvate thereof as an active ingredient combined with at least one pharmaceutical 
carrier or excipient. Such pharmaceutical compositions may be used in the treatment of 
clinical conditions for which a modifier of the activity of the serotonin 5-HT 2 a receptor is 
indicated. At least one compound of formula (A) may be combined with the carrier in either 
solid or liquid form in a unit dose formulation. The pharmaceutical carrier must be 
10 compatible with the other ingredients in the composition and must be tolerated by the 
individual recipient. Other physiologically active ingredients may be incorporated into the 
pharmaceutical composition of the invention if desired, and if such ingredients are 
compatible with the other ingredients in the composition. Formulations may be prepared by 
any suitable method, typically by uniformly mixing the active compound(s) with liquids or 
15 finely divided solid carriers, or both, in the required proportions, and then, if necessary, 
forming the resulting mixture into a desired shape. 

Conventional excipients, such as binding agents, fillers, acceptable wetting agents, 
tabletting lubricants, and disintegrants may be used in tablets and capsules for oral 
administration. Liquid preparations for oral administration may be in the form of solutions, 
20 emulsions, aqueous or oily suspensions, and syrups. Alternatively, the oral preparations may 
be in the form of dry powder which can be reconstituted with water or another suitable liquid 
vehicle before use. Additional additives such as suspending or emulsifying agents, non- 
aqueous vehicles (including edible oils), preservatives, and flavorings and colorants may be 
added to the liquid preparations. Parenteral dosage forms may be prepared by dissolving the 
25 compound of the invention in a suitable liquid vehicle and filter sterilizing the solution before 
filling and sealing an appropriate vial or ampoule. These are just a few examples of the many 
appropriate methods well known in the art for preparing dosage forms. 

The fifth aspect of the present invention provides for the use of a compound of 
formula (A) in the preparation of a medicament for the treatment of a medical condition for 
30 which a modifier of the activity of the serotonin 5-HT 2 a receptor is indicated. 

A sixth aspect of the present invention provides for a method of treatment of a 
clinical condition of a mammal, such as a human, for which a modifier of the activity of the 
serotonin 5-HT 2 a receptor is indicated, which comprises the administration to the mammal 
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of a therapeutically effective amount of a compound of formula (A) or a physiologically 
acceptable salt, solvate, or physiologically functional derivative thereof. 

Experimental Data 

5 Mass spectra were recorded on a Micromass Platform LC with Gilson HPLC. Infra- 

red spectra were recorded on a Nicolet Avatar 360 FT-IR. Melting points were recorded on 
a Electrothermal IA9200 apparatus and are uncorrected. Proton nuclear magnetic resonance 
spectra were recorded on a Bruker 300MHz machine. Chemical shifts are given with respect 
to tetramethylsilane. In the text the following abbreviations are used; s (singlet), d (doublet), 

10 t (triplet), m (multiplet) or combinations thereof. Chemical shifts are quoted in parts per 
million (ppm) and with coupling constants in Hertz. 

Thin layer chromatography was carried out using aluminium backed silica plates 
(250jaL; GF 254 ). HPLC was recorded either on a HP Chemstation 1100 HPLC using a 
Hichrom 3.5 CI 8 reverse phase column (50mm x 2.1mm i.d.). Linear gradient elution over 

15 5 minutes - 95% water (+0.1% TFA) / 5% acetonitrile (+0.05% TFA) down to 5% water / 
95% acetonitrile. Flow rate 0.8mL/min [Method A]; or on a Hichrom 3.5 CI 8 reverse phase 
column (100mm x 3.2mm i.d.). Linear gradient elution over 11 minutes - 95% water 
(+0.1% TFA) / 5% acetonitrile (+0.05% TFA) down to 5% water / 95% acetonitrile. Flow 
rate ImL/min [Method B]. Samples were routinely monitored at 254nM unless otherwise 

20 stated. 

All reagents were purchased from commercial sources. 

Experiment 1 
Preparation and Analysis of 103487 
25 N-[3-(4-bromo- 1 -methy lpyrazol-3 -y l)pheny 1] [ { (4-trifluoromethoxy)phenyl } amino] carboxamide 
This compound is commercially available from Maybridge Chemical Company, 
Catalog No. KM04515. 

Experiment 2 
Preparation and Analysis of 1 16100 
30 N-[3-(4-bromo-l-methylpyrazol-3~yl)phenyl][4-methoxyphenoxy]carboxamide 

To 4-methoxyphenylchloroformate (19mg, O.lOmmol) in CH 2 C1 2 (0.5mL) was added 
dropwise a solution of 3-(3-aminophenyl)-4-bromo-l-methylpyrazole (25mg, O.lOmmol) and 
triethylamine (14|oL, O.lOmmol) in CH 2 C1 2 (0.5mL). The mixture was stirred for 16 h and 
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concentrated. Chromatography on flash silica (40% EtOAc/hexane) gave the title compound as 
a colourless solid (21mg, 52%), m.p. 140.3-141.8°C (EtOAc/hexane). 

IR: v max = 1748, 1592, 1504, 1412, 1190, 835, 764, 676 cm" 1 . MS (ES+): m/z (%>) = 
404 (M+H 81 Br, 100), 402 (M+H 79 Br, 90). 

^-NMR (CD 3 OD): 8 = 3.80 (3H, s, CH 3 ), 3.81 (3H, s, CH 3 ), 6.91-6.98 (2H, m, ArH), 
7.07-7.18 (3H, m, ArH), 7.42-7.53 (4H, m, ArH). HPLC: retention time 3.28 mins [Method 
A]. Tic :Rf 0.4 (EtOAc/hexane). 

Experiment 3 
Preparation and Analysis of 116101 
N-f3<4-bromo-l-rnethylpyrazol-3-yl)phenyl][4-trifluoromethoxyphenyl]carboxamide 
To 4-(trifluoromethoxy)benzoyl chloride (19|aL, 0.12mmol) in CH 2 C1 2 (lmL) was 
added dropwise a solution of 3-(3-aminophenyl)-4-bromo-l-methylpyrazole (30mg, 
0.12mmol) and triethylamine (17|uL, 0.12mmol) in CH 2 C1 2 (0.5mL). The reaction mixture 
was stirred for 16 h and concentrated. Chromatography on flash silica (50% EtOAc/hexane) 
gave the title compound as a colourless solid (40mg, 76%), m.p. 138.6-1 3 9.6°C 
(EtOAc/hexane). 

MS (ES+): m/z (%) = 442 (M+H 81 Br, 93), 440 (M+H 79 Br, 100). 
'H-NMR (DMSO d 6 ): 5 = 3.79 (3H, s, CH 3 ), 7.27 (1H, m, ArH), 7.45-7.60 (3H, m, 
ArH), 7.65 (1H, s, ArH), 7.87 (2H, m, ArH), 8.09 (2H, m, ArH), 10.51 (1H, s, NH). 

HPLC: retention time 3.60 min [Method A]. TLC: Rf 0.40 (50% EtOAc/hexane). 

Experiment 4 
Preparation and Analysis of 1 16102 
N-[3-(4-bromo-l-methylpyrazol-3-yl)phenyl][2-thienyl]carboxamide 
To thiophene-2-carbonyl chloride (ll|aL, 0.09mmol) in CH 2 C1 2 (lmL) was added 
dropwise a solution of 3-(3-aminophenyl)-4-bromo-l-methylpyrazole (25mg, 0.09mmol) 
and triethylamine (14|aL, 0.09mmol) in CH 2 C1 2 (0.5mL). The reaction mixture was stirred 
for 16 h and concentrated. Chromatography on flash silica (50% EtOAc/hexane) gave the 
title compound as a colourless solid (24mg, 68%), m.p. 127. 8-128. 6°C (EtOAc/hexane). 
MS (ES+): m/z (%) = 364 (M+H 81 Br, 96), 362 (M+H 79 Br, 100). 
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'H-NMR (CD3OD): 5-3.81 (3H, s, CH 3 ), 7.19 (2H, m, ArH), 7.48-7.58 (2H, m, 
ArH), 7.68-7.83 (3H, m, ArH), 7.93 (1H, dd, J=1.0, 3.8, ArH). 

HPLC: retention time 3.12 min [Method A]. TLC: Rf 0.30 (30% EtOAc/hexane). 



5 Experiment 5 

Preparation and Analysis of 116115 
N-[3-(4-bromo-l-methylpyrazol-3-yl)phenyI][{(4- 
trifluoromethoxy)phenyl)methyl}amino]carboxamide 
To a stirred solution of triphosgene (12mg, 0.04mmol) in CH 2 C1 2 (0.5mL) was 
10 added dropwise a solution of 3-(3-aminophenyl)-4-bromo-l-methylpyrazole (30mg, 
0.12mmol) and triethylamine (33 pL, 0.24mmol) in CH 2 C1 2 (0.5mL). After 1 h, 4- 
(trifluoromethoxy)benzylamine (23mg, 0.12mmol) was added. The reaction mixture was 
stirred for 16 h and concentrated. Chromatography on flash silica (75%EtOAc/hexane) gave 
the title compound as a colourless solid (38mg, 68%), m.p. 144.6-145. 8°C (EtOAc/hexane). 
15 IR: v max = 1626, 1558, 1278, 1 160, 969, 871, 789, 703 cm" 1 . MS (ES+): m/z (%) = 471 

(M+H 81 Br, 91), 469 (M+H 79 Br, 100). 

'H-NMR (CD3OD): 6 = 3.81 (3H, s, CH 3 ), 4.42 (2H, s, CH 2 ), 7.06 (1H, d, J-7.1, ArH), 
7.24 (2H, d, J-8.4, ArH), 7.37-7.52 (6H, m, ArH). HPLC: retention time 3.06 mins [Method 
A]. Tlc:Rf0.5(EtOAc). 

20 

Experiment 6 
Preparation and Analysis of 1 16120 
N-[3-(4-bromo-l-methylpyrazol-3-yl)phenyl][4-chlorophenyl]carboxamide 
To 4-chlorobenzoyl chloride (15mg, 0.08mmol) in CH 2 C1 2 (lmL) was added 
25 dropwise a solution of 3-(3-aminophenyl)-4-bromo-l-methylpyrazole (21mg, 0.08mmol) 
and triethylamine (12^iL, 0.08mmol) in CH 2 C1 2 (0.5mL). The mixture was stirred for 16 h 
and concentrated. Chromatography on flash silica (50% EtOAc/hexane) gave the title 
compound as a colourless solid (23mg, 72%), m.p. 1 84.4-1 84.8°C (EtOAc/hexane). 

MS (ES+): m/z (%) = 394 (M+H 81 Br 37 C1, 34), 392 (M+H 79 Br 37 C1 ( 81 Br 35 C1), 
30 1 00), 390 (M+H 79 Br 35 C1, 67). 

'H-NMR (DMSO d 6 ): 5 = 3.79 (3H, s, CH 3 ) 5 7.25 (1H, d, J=7.9, ArH), 7.51-7.65 
(3H, m, ArH), 7.69 (1H, s, ArH), 7.90 (2H, m, ArH), 8.00 (2H, m, ArH), 10.51 (1H, s, NH). 
HPLC: retention time 3.40 min [Method A]. TLC: Rf 0.35 (50% EtOAc/hexane). 
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Experiment 7 
Preparation and Analysis of 1 16137 
N-[3-(4-bromo- 1 -methy lpyrazol-3 ^ 

A solution of 3-(3-aminophenyl)-4-bromo-l-methylpyrazole (35mg, 0.14mmol) and 
triethylamine (23jLtL, 0.17mmol) in DMF (0.5mL) was added in one portion to a stirred 
solution of 4-trifluoromethoxyphenylacetic acid (31mg, 0.14mmol), HBTU (53mg, 
0.14mmol) and HOBT (19mg, 0.14mmol) in DMF (ImL). The mixture was heated at 70°C 
for 24 h and then quenched with aqueous sodium bicarbonate solution. Ethyl acetate was 
added and the organic phase separated, washed with water (x3), brine, dried (MgS0 4 ) and 
evaporated. Chromatography on flash silica (50%EtOAc/hexane) gave the title compound 
as a colourless solid (43mg, 68%), m.p. 141.2-142.5°C (EtOAc/hexane). 

IR: v max = 1684, 1592, 1510, 1253, 1217, 1157, 987, 798,700 cm" 1 . 

MS (ES+): m/z (%) = 456 (M+H 81 Br, 100), 454 (M+H 79 Br, 94). 

'H-NMR (DMSO d 6 ): 5 = 3.72 (2H, s, CH 2 ), 3.75 (3H, s, CH 3 ), 7.17 (1H, d, J-7.7, 
ArH), 7.33 (2H, d, J=8.7, ArH), 7.38-7.51 (3H, m, ArH), 7.62-7.73 (3H, m, ArH), 10.44 
(1H, s, NH). 

HPLC: retention time 3.52 min [Method A]. 

Experiment 8 
Preparation and Analysis of 1 16174 
N-[3K4-bromo-l-methylpyrazol-3-yl)phenyl]-2-(3-fluorophenyl)acetamide 

A mixture of 3-(3-aminophenyl)-4-bromo-l-methylpyrazole (30 mg, 0.12 mmol), 3- 
fluorophenylacetic acid (18 mg, 0.12 mmol), 1-hydroxybenzotriazole hydrate (16 mg, 0.12 
mmol) and 2-(lH-benzotriazole-l-yl)-l,l,3,3-tetramethyluronium hexafluoro-phosphate (46 
mg, 0.12 mmol) were dissolved in chloroform (1.5 ml). N, N-Diisopropylethylamine (0.02 
ml, 0.13 mmol) was added and the mixture stirred at room temperature for 16h. The 
reaction mixture was then poured into brine and the organic layer washed with further brine, 
dried over magnesium sulphate and then concentrated in vacuo. The crude product was 
purified by column chromatography (ethyl acetate-toluene, 1:1), giving the title compound 
(12 mg, 26 %). Rf 0.41 (ethyl acetate-toluene, 1:1). 
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HPLC (Method B): retention time 7.07 min (100 %). 5 H (CDC1 3 ) 3.77 (2H, s), 3.83 
(3H, s), 7.02 - 7.20 (4H, m), 7.54 (1H, s), 7.60 - 7.63 (1H, m). MS (AP+): m/z (%) = 390 
(M + H 81 Br, 100), 388 (M + H 79 Br, 100). 

5 Experiment 9 

Preparation and Analysis of 1 16175 
N-[3-(4-bromo-l-methylpyrazolO-yl)phenyl]-2-(3-methoxyphenyl)acetamide 



A solution of 3-methoxyphenylacetyl chloride (0.02 ml, 0.12 mmol) in 
10 dichloromethane (0.75 ml) was added dropwise at 0 °C to a solution of 3-(3-aminophenyl)- 
4-bromo-l-methylpyrazole (30 mg, 0.12 mmol) and triethylamine (0.02 ml, 0.13 mmol) in 
dichloromethane (0.75 ml). The resulting mixture was stirred at room temperature for 16h 
and then poured into brine. The organic layer was washed with more brine then dried over 
magnesium sulphate and concentrated in vacuo. The crude product was purified by column 
15 chromatography (ethyl acetate-toluene, 1:1), giving the title compound (9 mg, 19 %). Rf 
0.30 (ethyl acetate-toluene, 1:1). 

HPLC (Method B): retention time 8.62 min (97.09 %). 8 H (CDC1 3 ) 3.76 (2H, s), 
3.82 (3H, s), 3.85 (3H, s), 6.84 - 6.90 (3H, m), 7.07 - 7.44 (5H, m), 7.53 (1H, s), 7.60 (1H, 
br s). MS (AP+): m/z (%) = 402 (M + H 81 Br, 100), 400 (M + H 79 Br, 95). 

20 

Experiment 10 
Preparation and Analysis of 1 16176 
N-[3-(4-bromo-l-methylpyrazol-3-yl)phenyl]-2-(2-fluorophenyl)acetamide 



25 A mixture of 3-(3-aminophenyl)-4-bromo-l-methylpyrazole (30 mg, 0.12 mmol), 2- 

fluorophenylacetic acid (18 mg, 0.12 mmol), 1-hydroxybenzotriazole hydrate (16 mg, 0.12 
mmol) and 2-(lH-benzotriazole-l-yl)-l,l ? 3,3-tetramethyluronium hexafluoro-phosphate (46 
mg, 0.12 mmol) were dissolved in chloroform (1.5 ml). N, N-Diisopropylethylamine (0.02 
ml, 0.13 mmol) was added and the mixture stirred at room temperature for 16h. The 

30 reaction mixture was then poured into brine and the organic layer washed with further brine, 
dried over magnesium sulphate and then concentrated in vacuo. The crude product was 
purified by column chromatography (ethyl acetate-toluene, 1:1), giving the title compound 
(15 mg, 32 %). Rf 0.52 (ethyl acetate-toluene, 1:1). 
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HPLC (Method B): retention time 7.28 min (100 %). 5 H (CDC1 3 ) 3.79 (2H, s), 3.83 
(3H, s), 7.1 1 - 7.23 (3H, m), 7.30 - 7.55 (6H, m), 7.61 - 7.64 (1H, m). MS (AP+): m/z (%) 
= 390 (M + H 81 Br, 100), 388 (M + H 79 Br, 100). 

5 Experiment 1 1 

Preparation and Analysis of 1 16177 
N_[3>(4-bromo-l-methylpyrazol-3-yl)phenyl]-2-(4-nitrophenyl)acetamide 

A mixture of 3-(3-aminophenyl)-4-bromo-l-methylpyrazole (30 mg, 0.12 mmol), 4- 
10 nitrophenylacetic acid (22 mg, 0.12 mmol), 1 -hydroxybenzotriazole hydrate (16 mg, 0.12 
mmol) and 2-(lH-benzotriazole-l-yl)»l,l,3,3-tetramethyluronium hexafluorophosphate (46 
mg, 0.12 mmol) were dissolved in chloroform (1.5 ml). N, N-Diisopropylethylamine (0.02 ml, 
0.13 mmol) was added and the mixture stirred at room temperature for 16h. The reaction 
mixture was then poured into brine and the organic layer washed with further brine, dried over 
15 magnesium sulphate and then concentrated in vacuo. The crude product was purified by 
column chromatography (ethyl acetate-toluene, 1:1), giving the title compound (9 mg, 18 %). 
Rf 0.19 (ethyl acetate-toluene, 1:1). 

HPLC (Method B): retention time 7.22 min (94.30 %). 5 H (CDC1 3 ) 3.83 (3H, s), 3.87 
(2H, s), 7.18 - 7.23 (1H, m), 7.42 - 7.65 (7H, m), 8.22 - 8.30 (2H, m). MS (AP+): m/z (%) = 
20 417 (M + H 81 Br, 100), 415 (M + H 79 Br, 100). 



Experiment 12 

Preparation and Analysis of 1 16178 
N-[3-(4-bromo-l-methylpyrazol-3-yl)phenyl]-2-(2-methoxyphenyl)acetamide 

25 

A mixture of 3-(3-aminophenyl)-4-bromo-l-methylpyrazole (30 mg, 0.12 mmol), 2- 
methoxyphenylacetic acid (20 mg, 0.12 mmol), 1 -hydroxybenzotriazole hydrate (16 mg, 
0. 12 mmol) and 2-(lH-benzotriazole-l-yl)-l ,1 ,3,3-tetramethyluronium hexafluoro- 
phosphate (46 mg, 0.12 mmol) were dissolved in chloroform (1.5 ml). N, N-Diisopropyl- 
30 ethylamine (0.02 ml, 0.13 mmol) was added and the mixture stirred at room temperature for 
16h. The reaction mixture was then poured into brine and the organic layer washed with 
further brine, dried over magnesium sulphate and then concentrated in vacuo. The crude 
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product was purified by column chromatography (chloroform-methanol, 99:1), giving the 
title compound (18 mg, 38 %) as a colourless solid. Rf 0.65 (chloroform-methanol, 98:2). 

HPLC (Method B): retention time 7.16 min (100 %). 5 H (CDC1 3 ) 3.76 (2H, s), 3.83 
(3H, s), 3.98 (3H, s), 6.97 - 7.06 (2H, m), 7.11 - 7.16 (1H, m), 7.31 - 7.50 (4H, m), 7.53 
5 (1H, s), 7.57 - 7.60 (1H, m), 7.91 (1H, br s). MS (AP-): m/z (%) = 400 (M - H 81 Br, 90), 
398 (M - H 79 Br, 100). 

Experiment 13 
Preparation and Analysis of 1 16192 
10 {[3-(4-bromo-l-methylpyrazol-3-yl)ph 

To di-terf-butyl dicarbonate (36mg, 0.17mmol) in methanol (lmL) was added 
dropwise a solution of 3-(3-aminophenyl)-4-bromo-l-methylpyrazole (42mg, 0.17mmol) in 
methanol (lmL). The mixture was stirred for 16 h and concentrated. Chromatography on 
flash silica (40%EtOAc/heaxne) gave the title compound as a colourless solid (29mg, 49%) 
15 (EtOAc/hexane). 

MS (CI-): m/z (%) = 352 (M-H 81 Br, 100), 350 (M-H 79 Br, 96). 
] H-NMR (DMSO d 6 ): 8 = 1.46 (9H, s, 3xCH 3 ), 3.73 (3H, s, CH 3 ), 7.07 (1H, m, 
ArH), 7.42 (1H, t, J=7.7, ArH), 7.53-7.60 (2H, m, ArH), 7.64 (1H, s, ArH), 9.57 (1H, s, 
NH). 

20 HPLC : retention time 7.15 min [Method B] . 

One or the other (as indicated) of the two following synthetic protocols was used to 
generate each of the compounds below: 
Protocol A: 

25 To an isocyanate (lmmol) in CH 2 C1 2 (4mL) was added dropwise a solution of 3-(3- 

aminophenyl)-4-bromo-l-methylpyrazole (lmmol) in CH 2 C1 2 (4mL). The mixture was 
stirred for 16 hours and concentrated. Chromatography on flash silica (20%-80% 
EtOAc/hexane) followed by recrystallisation gave the pure urea. 
Protocol B: 

30 To a stirred solution of triphosgene (0.33mmol) in CH 2 C1 2 (4mL) was added 

dropwise a solution of 3-(3-aminophenyl)-4-bromo-l-methylpyrazole (lmmol) and 
triethylamine (2mmol) in CH 2 C1 2 (4mL). After 1 hour, an aniline was added (lmmol). The 
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reaction mixture was stirred for 16 hours and concentrated. Chromatography on flash silica 
(20%-80%EtOAc/hexane) followed by recrystallisation gave the pure urea. 

Experiment 14 
Preparation and Analysis of 1 16079 
N-[3-(4-bromo-l-methylpyrazol~3-yl)phenyl][(4-methylthiophenyl)amino]carboxamide 

[Protocol A] - 4-(methylthio)phenyl isocyanate 
colourless solid (EtOAc/hexane) 

MS (ES+): m/z (%) = 419 (M+H 81 Br, 100), 417 (M+H 79 Br, 94). 
'H-NMR (MeOH d 4 ): 8 = 2.42 (3H, s, SCH 3 ), 3.81 (3H, s, NCH 3 ), 7.06 (1H, m, 
ArH), 7.22 (2H, m, ArH), 7.37 (2H, m, ArH), 7.42-7.61 (4H, m, ArH). 
HPLC: retention time 3.35 min [Method A], 

Experiment 15 

Preparation and Analysis of 1 16081 

N-[3-(4-bromo-l-methylpyrazoI-3-yl)phenyl][ (4-chlorophenyl)amino]carboxamide 

[Protocol A] - 4-chlorophenyl isocyanate 
colourless solid (EtOAc/hexane) 

MS (ES+): m/z (%) = 409 (M+H 81 Br 37 C1, 19), 407 (M+H 79 Br 37 C1 ( 81 Br 35 C1), 
100), 405 (M+H 79 Br 35 C1, 81). 

*H-NMR (MeOH d 4 ): 5 = 3.81 (3H, s, CH 3 ), 7.07 (1H, m, ArH), 7.23 (2H, m ? ArH), 
7.36-7.60 (6H, m, ArH). 

HPLC: retention time 3.42 min [Method A]. 

Experiment 16 
Preparation and Analysis of 1 16082 
{[3-(4-bromo-l-methylpyrazol-3-yl)phenyl]amino}-N-(4-fluorophenyl)carboxamide 
[Protocol A] - 4-fluorophenyl isocyanate 
colourless solid (EtOAc/hexane) 
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MS (ES+): m/z (%) = 391 (M+H 81 Br, 96), 389 (M+H 79 Br, 100). 
l H-NMR (MeOH d 4 ): 5 = 3.81 (3H, s, CH 3 ), 6.93-7.11 (3H, m, ArH), 7.37-7.61 (6H, 
m, ArH). 

HPLC: retention time 3.11 min. 

Experiment 17 
Preparation and Analysis of 1 16087 
{[3-(4-bromo-l-methylpyrazol-3-yl)phenyl]am 
[Protocol A] - 2-(trifluoromethoxy)phenyl isocyanate 
colourless solid (EtOAc/hexane) 

MS (ES+): m/z (%) = 457 (M+H 81 Br, 100), 455 (M+H 79 Br, 95). 

] H-NMR (DMSO d 6 ): 5 = 3.79 (3H, s, CH 3 ), 7.06-7.18 (2H 5 m, ArH), 7.38-7.49 
(2H, m, ArH), 7.51-7.62 (2H, m, ArH), 7.65 (1H, m, ArH), 7.71 (1H, s, ArH), 8.24 (1H, dd, 
J=l.l, 8.2, ArH), 8.56 (1H, s, NH), 9.49 (1H, s, NH). 

HPLC: retention time 3.40 min. 

Experiment 1 8 
Preparation and Analysis of 1 16089 
{[3.(4_bromo-l-methylpyrazol-3-yl)phenyl]amino}-N-(2-nitrophenyl)carboxamide 
[Protocol A] - 2-nitrophenyl isocyanate 
yellow solid (EtOAc/hexane) 

MS (ES+): m/z (%) = 418 (M+H 81 Br, 98), 416 (M+H 79 Br, 100). 

'H-NMR (DMSO d 6 ): 5 = ! H-NMR (DMSO d 6 ): □ = 3.79 (3H, s, NCH 3 ), 7.14 (1H, 
m, ArH), 7.24 (1H, m, ArH), 7.50 (1H, t, JM7.7, ArH), 7.60 (2H, m, ArH), 7.67 (1H, s, 
ArH), 7.71 (1H, s, ArH), 8.10 (1H, m, ArH), 8.29 (1H, m, ArH), 9.65 (1H, s, NH), 10.09 
(1H, s, NH). 

HPLC: retention time 3.10 min [Method A]. 

Experiment 19 
Preparation and Analysis of 1 16091 
{[3-(4-bromo-l-rnethylpyrazol-3-yl)phenyl]amino}-N-(4-rnethoxyphenyl)carboxamide 
[Protocol A] - 4-methoxyphenyl isocyanate 
colourless solid (EtOAc/hexane) 
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MS (ES+): m/z (%) = 403 (M+H 8I Br, 100), 401 (M+H 79 Br, 96). 

*H-NMR (DMSO d 6 ): 6 = 3.71 (3H, s, OCH 3 ), 3.79 (3H, s, NCH 3 ), 6.87 (2H, d, 
J-8.9, ArH), 7.06 (1H, d, J=7.5, ArH), 7.39 (2H, d, J=8.9, ArH), 7.45-7.61 (3H, m, ArH), 
7.65 (1H, s, ArH), 8.52 (1H, s, NH), 8.84 (1H, s, NH). 

HPLC: retention time 3.08 min. 

Experiment 20 
Preparation and Analysis of 1 1 6092 
{[3-(4-bromo-l-methylpyrazol-3^ 
[Protocol A] - o-tolyl isocyanate 

colourless solid (EtOAc/hexane) 

MS (ES+): m/z (%) = 387 (M+H 81 Br, 94), 385 (M+H 79 Br, 100). 

'H-NMR (MeOH d 4 ): 6 = 2.29 (3H, s, CH 3 ), 3.81 (3H, s, NCH 3 ) ? 7.03 (1H, dt, 
J=l.l,7.5, ArH), 7.09 (1H, dt, J=l.l, 7.5, ArH), 7.13-7.22 (2H, m, ArH), 7.45 (1H, t, J=7.9, 
ArH), 7.49-7.57 (2H, m, ArH), 7.60-7.68 (2H, m, ArH). 

HPLC: retention time 2.96 min. 

Experiment 21 
Preparation and Analysis of 1 1 6097 
{[3-(4-bromo-l-methylpyrazol-3-yl)phenyl]am 
[Protocol A] - 4-(trifluoromethyl)phenyl isocyanate 
colourless solid (EtOAc/hexane) 

MS (ES+): m/z (%) = 441 (M+H 81 Br, 94), 439 (M+H 79 Br, 100). 
l H-NMR (MeOH d 4 ): 5 = 3.82 (3H, s, CH 3 ), 7.04-7.16 (3H, m, ArH), 7.20-7.47 (6H, 
m, ArH). 

HPLC: retention time 3.56 min. 

Experiment 22 
Preparation and Analysis of 1 16105 
{[3-(4-bromo-l-methylpyrazol-3-yl)phenyl]amino}-N-(3-chlorophenyl)carboxamide 
[Protocol A] - 3-chlorophenyl isocyanate 
colourless solid (EtOAc/hexane) 
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MS (ES+): m/z (%) = 409 (M+H 8I Br 37 C1, 26), 407 (M+H 79 Br 37 C1 ( 81 Br 35 C1), 
100), 405 (M+H 79 Br 35 C1, 70). 

! H-NMR (MeOH d 4 ): 5 = 3.81 (3H, s, NCH 3 ), 7.04 (1H, m, ArH), 7.10 (1H, m, 
ArH), 7.28 (2H, m, ArH), 7.47 (1H, t, J=7.8, ArH), 7.55 (1H, m, ArH), 7.63 (1H, m, ArH), 
7.68 (1H, s, ArH), 7.73 (1H, m, ArH), 9.04 (2H, s, NH). 

HPLC: retention time 3.20 min [Method A]. 

Experiment 23 
Preparation and Analysis of 1 16108 
{ [3-(4-bromo- 1 -methy lpyrazol-3 -yl)phenyl]amino} -N-(2-chlorophenyI)carboxamide 
[Protocol A] - 2-chlorophenyl isocyanate 
colourless solid (EtOAc/hexane) 

MS (ES+): m/z (%) = 409 (M+H 81 Br 37 C1, 24), 407 (M+H 79 Br 37 C1 ( 81 Br 35 C1), 
100), 405 (M+H 79 Br 35 C1, 72). 

! H-NMR (MeOH d 4 ): 8 = 3.81 (3H, s, NCH 3 ), 7.03 (1H, m, ArH), 7.11 (1H, m, 
ArH), 7.28 (1H, m, ArH), 7.35-7.53 (3H, m, ArH), 7.55 (1H, s, ArH), 7.62 (1H, m, ArH), 
8.11 (lH,m, ArH). 

HPLC: retention time 3.13 min. 

Experiment 24 
Preparation and Analysis of 1 161 10 
{ [3-(4-bromo~ 1 -methy lpyrazol-3 ^ 
[Protocol A] - 4-isopropylphenyl isocyanate 
colourless solid (THF/hexane) 

MS (ES+): m/z (%) = 415 (M+H 81 Br, 100), 413 (M+H 79 Br, 92). 

] H-NMR (MeOH d 4 ): 5 = 1.23 (6H, d, J=6.8, 2xCH 3 ), 2.86 (1H, septet, J=6.8, CH), 
3.82 (3H, s, NCH 3 ), 7.09 (1H, m, ArH), 7.16 (2H, d, J=7.6, ArH), 7.31 (2H, d, J=7.6, ArH), 
7.42-7.51 (2H, m, ArH), 7.54 (1H, s, ArH), 7.59 (1H, m, ArH). 

HPLC: retention time 3.66 min. 



Experiment 25 
Preparation and Analysis of 1 161 1 1 
{ [3 _(4-bromo- 1 -methy lpyrazol-3 
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[Protocol A] - 3 -methoxy phenyl isocyanate 
colourless solid (EtOAc/hexane) 

MS (ES+): m/z (%) = 403 (M+H 81 Br, 100), 401 (M+H 79 Br, 96). 
l H-NMR (MeOH d 4 ): 8 = 3.73 (3H, s, OCH 3 ), 3.81 (3H, s, NCH 3 ), 6.59 (1H, m, 
ArH), 6.91 (1H, m, ArH), 7.08 (1H, m, ArH), 7.14 (2H, m, ArH), 7.39-7.61 (4H 5 m, ArH). 
HPLC: retention time 2.90 min. 

Experiment 26 
Preparation and Analysis of 1 161 12 
{[3_(4_bromo-l-methylpyrazol-3-yl)phenyl]amino}-N-(3-methylphenyl)carboxamide 
[Protocol A] - ra-tolyl isocyanate 

colourless solid (EtOAc/hexane) 

MS (ES+): m/z (%) = 387 (M+H 81 Br, 100), 385 (M+H 79 Br, 96). 

'H-NMR (DMSO d 6 ): 5 = 2.26 (3H, s, CH 3 ), 3.76 (3H, s, NCH 3 ), 6.79 (1H, m, 
ArH), 7.06-7.22 (3H, m, ArH), 7.29 (1H, m, ArH), 7.43-7.62 (3H, m, ArH), 7.68 (1H, s, 
ArH), 8.65 (1H, s, NH), 8.89 (1H, s, NH). 

HPLC: retention time 3.05 min [Method A]. 

Experiment 27 
Preparation and Analysis of 1 1 61 1 3 
{[3-(4-bromo-l-methylpyrazol-3-yl)phenyl]amino}-N-methyl-N-[4- 
(trifluoromethoxy)phenyl]carboxamide 
[Protocol B] - iV-methyl-4-(trifluoromethoxy)aniline 
pale yellow solid (EtOAc/hexane) 

MS (ES+): m/z (%) = 471 (M+H 81 Br, 88), 469 (M+H 79 Br, 100). 
*H-NMR (MeOH d 4 ): 5 = 3.35 (3H, s, NCH 3 ), 3.81 (3H, s, NCH 3 ), 7.09 (1H, m, 
ArH), 7.25-7.51 (8H, m, ArH). 

HPLC: retention time 3.56 min [Method A]. 

Experiment 28 
Preparation and Analysis of 1 161 19 
N-[4-(/er/-butyl)phenyl]{[3-(4-bromo-l-methylpyrazol-3-yl)phenyl]amino}carboxamid^ 
[Protocol B] - 4-ter/-butylaniline 
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colourless solid (EtOAc/hexane) 

MS (ES+): m/z (%) = 429 (M+H 81 Br, 98), 427 (M+H 79 Br, 100). 

'H-NMR (DMSO d 6 ): 5 = 1.27 (9H, s, 3xCH 3 ), 3.79 (3H, s, NCH 3 ), 7.07 (1H, d, 
J=7.5, ArH), 7.29 (2H, d, J=8.7, ArH), 7.37 (2H, d, J=8.7, ArH), 7.45 (1H, t, J=7.5, ArH), 
7.51-7.60 (2H, m, ArH), 7.66 (1H, s, ArH), 8.65 (1H, s, NH), 8.83 (1H, s, NH). 

HPLC: retention time 3.77 min. 

Experiment 29 
Preparation and Analysis of 1 16122 
N-[4-(dimethylamino)phenyl] { ^ 
[Protocol B] - AyV-dimethyl-j9-phenylenediamine 
colourless solid (EtOAc/hexane) 

MS (ES+): m/z (%) - 416 (M+H 81 Br, 96), 414 (M+H 79 Br, 100). 

*H-NMR (DMSO d 6 ): 5 = 2.86 (6H, s, NCH 3 ), 3.80 (3H, s, NCH 3 ), 6.80 (2H, m, 
ArH), 7.09 (1H, d, J=7.7, ArH), 7.28 (2H, m, ArH), 7.42 (1H, t, J=7.8, ArH), 7.52 (1H, m, 
ArH), 7.59 (1H, s, ArH), 7.67 (1H, s, ArH), 8.45 (1H, s, NH), 8.75 (1H, s, NH). 

HPLC: retention time 2.07 min [Method A]. 

Experiment 30 
Preparation and Analysis of 1 16138 
N-(3,5-dichloro-4-methylphenyl){P 
[Protocol B] - 3,5-dichloro-4-methylphenylamine 
colourless solid (EtOAc/hexane) 

MS (ES+): m/z (%) = 457 (M+H, 35), 455 (M+H, 100), 453 (M+H, 65). 

'H-NMR (DMSO d 6 ): 5 - 2.32 (3H, s, CH 3 ), 3.79 (3H, s, NCH 3 ), 7.1 1 (1H, d, JN7.4, 
ArH), 7.46 (1H, t, J-7.8, ArH), 7.50-7.64 (4H, m, ArH), 7.68 (1H, s, ArH), 9.02 (1H, s, 
NH), 9.09 (1H, s, NH). 

HPLC: retention time 3.66 min. 

Experiment 3 1 
Preparation and Analysis of 1 16139 
{[3_(4_bromo-l-methylpyrazol-3-yl)phenyl]amino}-N-[4-(trifluoromethylthi 
[Protocol B] - 4-(trifluoromethylthio)aniline 
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colourless solid (EtOAc/hexane) 

MS (ES+): m/z (%) = 473 (M+H 81 Br, 100), 471 (M+H 79 Br, 94). 
'H-NMR (DMSO d 6 ): 5 = 3.81 (3H, s, NCH 3 ), 7.1 1 (1H, d, J-7.5, ArH), 7.47 (1H, t, 
J=7.9, ArH) ? 7.51-7.63 (6H, m, ArH), 7.66 (1H, s, ArH), 9.03 (1H, s, NH), 9.16 (1H, s, 
5 NH). 

HPLC: retention time 3.76 min. 

Experiment 32 
Preparation and Analysis of 1 16141 
10 {[3-(4-bromo»l-methylpyrazol-3-yl)phenyl]amino}-N-(cyclohexyl)carboxamide 
[Protocol B] - cyclohexylamine 

colourless solid, m.p. 155. 5-156. 3°C (EtOAc/hexane). 
MS (ES+): m/z (%) - 379 (M+H 81 Br, 93), 377 (M+H 79 Br, 100). 
! H-NMR (DMSO d 6 ): 8 = 1.07-1.34 (5H, m, 5xCH), 1.52 (1H, m, CH), 1.63 (2H, m, 
15 2xCH), 1.76 (2H, m, 2xCH), 3.48 (1H, m, NCH), 3.74 (3H 5 s, CH 3 ), 6.15 (1H, d, J=7.8, 
ArH), 6.98 (1H, d, J=7.5, ArH), 7.32-7.43 (2H, m, ArH), 7.51 (1H, m, NH), 7.62 (1H, s, 
ArH), 8.50 (1H, s, NH). 

HPLC: retention time 3.16 min [Method A]. 
TLC: retention factor 0.35 (50% EtOAc/hexane). 

20 

Experiment 33 
Preparation and Analysis of 1 16143 
{[3_(4-bromo-l-methylpyrazol-3-yl)phenyl]amino}-N-(phenylmethyI)carboxamide 
[Protocol B] - benzylamine 
25 colourless solid, m.p. 144.5-146.2°C (EtOAc/hexane). 

IR: Dmax = 1622, 1565, 1467, 1374, 1239, 973, 802, 752, 695 cm" 1 . 
MS (ES+): m/z (%) = 387 (M+H 81 Br, 89), 385 (M+H 79 Br, 100). 
! H-NMR (CD 3 OD): 5 = 3.81 (3H, s, CH 3 ), 4.40 (2H, s, CH 2 ), 7.05 (1H, m, ArH), 
7.19-7.51 (9H, m, ArH). 
30 HPLC: retention time 3.06 min [Method A], a 
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Experiment 34 
Preparation and Analysis of 1 16144 
{[3-(4-bromo-l-methylpyrazol-3-yI)phe^ 
[Protocol A] - 2-fluorophenyl isocyanate 
colourless solid (DCM/hexane) 

MS (ES+): m/z (%) = 391 (M+H 8I Br, 100), 389 (M+H 79 Br, 90). 
! H-NMR (MeOH d 4 ): 5 = 3.79 (3H, s, NCH 3 ), 7.00-7.11 (4H, m, ArH), 7.40-7.56 
(3H, m, ArH), 7.61 (1H, m, ArH), 8.09 (1H, m 5 ArH). 
HPLC: retention time 3.01 min. 

Experiment 35 
Preparation and Analysis of 1 16145 
2-({[3-(4-bromo-l-methylpyrazol-3-yl)phenyl]amino}carbonylamino)benzamide 
[Protocol B] - 2-aminobenzamide 

colourless solid (EtOAc/hexane) 

MS (ES+): m/z (%) = 399 (M+H -17 81 Br> 100), 397 (M+H - 17 79 Br, 94). 
'H-NMR (DMSO d 6 ): 5 - 3.79 (3H, s, NCH 3 ), 6.93-7.10 (2H, m, ArH), 7.45 (2H, t, 
J=7.8, ArH), 7.59-7.72 (5H, m, ArH), 8.22 (2H, m), 9.92 (1H, s, NH), 10.69 (1H, s, NH). 
HPLC: retention time 2.88 min. 

Experiment 36 
Preparation and Analysis of 1 16147 
{[3_(4>bromo-l-methylpyrazol-3-yl)phenyl]amino}-N-(4-cyanophenyl)carboxamide 
[Protocol B] - 4-aminobenzonitrile 

colourless solid (EtOAc/hexane) 

MS (ES+): m/z (%) = 398 (M+H 81 Br, 100), 396 (M+H 79 Br, 96). 
'H-NMR (MeOH d 4 ): 5 - 3.81 (3H, s 5 NCH 3 ), 7.12 (1H, m, ArH), 7.46-7.57 (3H, m, 
ArH), 7.62-7.69 (5H, m, ArH). 

HPLC: retention time 3.12 min. 



Experiment 37 
Preparation and Analysis of AR1 16148 
{[3_(4_bromo-l-methylpyrazol-3-yl)phenyl]amino}-N-(2-cyanophenyl)carboxamide 
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[Protocol B] - 2-aminobenzonitrile 

colourless solid (EtOAc/hexane) 

MS (ES+): m/z (%) = 398 (M+H 81 Br, 95), 396 (M+H 79 Br, 100). 

'H-NMR (CDCI3): 5 = 3.79 (3H, s, CH 3 ), 7.13-7.28 (2H, m, ArH), 7.49 (1H, t, 
J=7.8, ArH), 7.57 (1H, m, ArH), 7.62 (1H, m, ArH), 7.65-7.71 (2H, m, ArH), 7.78 (1H, m, 
ArH), 8.07 (1H, d, J=8.6, ArH), 8.83 (1H, s, NH), 9.62 (1H, s, NH). 

HPLC: retention time 3.05 min [Method A]. 



Experiment 38 
Preparation and Analysis of 1 16182 
{[3_(4-bromo-l-methylpyrazol-3-yl)pheny]]amino}-N-(4-fluorophenylmethyl)carboxarnide 
[Protocol B] - 4-fluorobenzylamine 

colourless solid, m.p. 1 85.5-1 86.6°C (EtOAc/hexane). 
MS (ES+): m/z (%) = 405 (M+H 81 Br, 97), 403 (M+H 79 Br, 100). 
•H-NMR (DMSO d 6 ): 6 = 3.75 (3H, s, CH 3 ), 4.28 (2H, d, J=6.0, CH 2 ), 6.73 (1H, t, 
J=5.9, NH), 7.01 (1H, d, J=7.5, ArH), 7.10-7.18 (2H, m, ArH), 7.27-7.41 (4H, m, ArH), 
7.56 (1H, s, ArH), 7.62 (1H, s, ArH), 8.82 (1H, s, NH). 
HPLC: retention time 3.10 min [Method A]. 
TLC: retention factor 0.25 (50% EtOAc/hexane). 



Experiment 39 
Preparation and Analysis of 1 16183 
{[3.(4-bromo-l-methylpyrazol-3-yl)phenyl]amino}-N-(3,4-dimethoxyphenyImethyl)carboxamide 
[Protocol B] - 3,4-dimethoxybenzylamine 

colourless solid, m.p. 174.9-175.5°C (EtOAc/hexane). 
MS (CI+): m/z (%) = 447 (M+H 8, Br, 100), 445 (M+H 79 Br, 92). 
'H-NMR (DMSO d 6 ): 5 = 3.71 (3H, s, CH 3 ), 3.73 (3H, s, CH 3 ), 3.76 (3H, s, CH 3 ), 
4.22 (2H, d, J=5.8, CH 2 ), 6.62 (1H, t, J=5.7, NH), 6.80 (1H, m, ArH), 6.89 (2H, m, ArH), 
6.98 (1H, m, ArH), 7.36-7.51 (3H, m, ArH), 7.63 (1H, s, ArH), 8.76 (1H, s, NH). 
HPLC: retention time 2.86 min [Method A]. 
TLC: retention factor 0.20 (50% EtOAc/hexane). 
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Experiment 40 
Preparation and Analysis of 1 16184 
{[3-(4-bromo-l-methylpyrazol-3-yl) 
[Protocol B] - 3,4,5-trimethoxybenzylamine 
colourless solid (EtOAc/hexane). 

MS (CI+): m/z (%) = 477 (M+H 81 Br, 100), 475 (M+H 79 Br, 95). 

'H-NMR (DMSO d 6 ): 6 = 3.63 (3H, s, OCH 3 ), 3.75 (9H, s, 3xCH 3 ), 4.21 (1H, d, 
J=5.9, CH 2 ), 6.61 (2H, s, ArH), 6.65 (1H, t, J=5.9, NH), 6.99 (1H, m, ArH), 7.40 (1H, t, 
J=7.7 5 ArH), 7.45 (1H, m, ArH), 7.56 (1H, m, ArH), 7.64 (1H, s, ArH), 8.77 (1H, s, NH). 

HPLC: retention time 5.91 min [Method B]. 

TLC: retention factor 0.50 (50% EtOAc/hexane). 

Experiment 4 1 
Preparation and Analysis of 1 161 85 
{[3_(4_bromo-l-methylpyrazol-3-yI)phenyl]amino}-N-(2-methylphenylmethyl)carboxamid 
[Protocol B] - 2-methylbenzylamine 

colourless solid (EtOAc/hexane). 

MS (CI+): m/z (%) = 401 (M+H 81 Br, 96), 399 (M+H 79 Br, 100). 

! H-NMR (DMSO d 6 ): 5 - 2.28 (3H, s, CH 3 ) ? 3.76 (3H, s, NCH 3 ), 4.28 (1H, d, J=5.8, 
CH 2 ), 6.60 (1H, t, J=5.8, NH), 7.01 (1H, m, ArH), 7.15 (3H, m, ArH), 7.24 (1H, m, ArH), 
7.38-7.50 (2H, m, ArH), 7.57 (1H, m, ArH), 7.65 (1H, s, ArH), 8.77 (1H, s, NH). 

HPLC: retention time 2.74 min [Method A]. 

TLC: retention factor 0.20 (50% EtOAc/hexane). 

Experiment 42 
Preparation and Analysis of 1 16189 
{[3-(4.bromo-l-methylpyrazol-3-yl)phenyl]amino}-N-(4-methoxyphenylmethyl)carboxamide 
[Protocol B] - 4-methoxybenzylamine 
colourless solid (EtOAc/hexane). 

MS (CI+): m/z (%) = 417 (M+H 8, Br, 94), 415 (M+H 79 Br, 100). 
^-NMR (DMSO d 6 ): 5 = 3.72 (3H, s, CH 3 ), 3.77 (3H, s, NCH 3 ), 4.22 (1H, d, J=5.9, 
CH 2 ), 6.62 (1H, t, J=5.9, NH), 6.90 (2H, d, J=8.8, ArH), 7.00 (1H, m, ArH), 7.23 (2H, d, 
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J=8.8, ArH), 7.39 (1H, t, J=7.8, ArH), 7.43 (1H, m, ArH), 7.56 (1H, m, ArH), 7.64 (1H, s, 
ArH), 8.73 (1H, s, NH). 

HPLC: retention time 6.41 min [Method B]. 

TLC: retention factor 0.25 (50% EtOAc/hexane). 

Experiment 43 
Preparation and Analysis of 1 16194 
{ [3_(4_bromo- 1 -methylpyrazo 
[Protocol B] - 2-(4-methoxyphenyl)ethylamine 
colourless solid (EtOAc/hexane). 

MS (ES+): m/z (%) = 431 (M+H 81 Br, 95), 429 (M+H 79 Br, 100). 

l H-NMR (DMSO d 6 ): 5 = 2.68 (2H, t, J=7.1, CH 2 ), 3.31 (2H, m, CH 2 ), 3.71 (3H, s, 
CH 3 ), 3.77 (3H, s, CH 3 ), 6.16 (1H, t, J=5.8, NH), 6.87 (2H, d, J=8.6, ArH), 6.99 (1H, dt, 
J=L4, 7.3, ArH), 7.16 (2H, d, J=8.6, ArH), 7.33-7.48 (2H, m, ArH), 7.52 (1H, m, ArH), 
7.63 (1H, s, ArH), 8.71 (1H, s, NH). 

HPLC: retention time 6.62 min [Method B], 

An important point that can be derived from the foregoing data is that by using a 
constitutively activated form of the receptor in the direct identification of candidate 
compounds, the selectivity of the compounds is exceptional: as those in the art appreciate, the 
homology between the human 5HT2A and 5HT2C receptors is about 95%, and even with such 
homology, certain of the directly identified compounds evidence a 4-order-of-magnitude 
(10,000-fold) selectivity separation (116100). This is important for pharmaceutical 
compositions in that such selectivity can help to reduce side-effects associated with interaction 
of a drug with a non-target receptor. 

Different embodiments of the invention will consist of different constitutively activated 
receptors, different expression systems, different assays, and different compounds. Those 
skilled in the art will understand which receptors to use with which expression systems and 
assay methods. All are considered within the scope of the teaching of this invention. In 
addition, those skilled in the art will recognize that various modifications, additions, 
substitutions, and variations to the illustrative examples set forth herein can be made without 
departing from the spirit of the invention and are, therefore, considered within the scope of the 
invention. 
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CLAIMS 

We claim: 

1. A cDNA encoding a constitutively active, non-endogenous version, of a human 
5HT 2C serotonin receptor comprising SEQ. ID NO. 28. 

2. A constitutively active non-endogenous human 5HT 2 c serotonin receptor encoded 
by the cDNA of SEQ ID NO. 28 comprising SEQ ID NO. 29. 

3. A cDNA encoding a constitutively active, non-endogenous version, of a human 
5HT 2A serotonin receptor comprising SEQ. ID NO. 30. 

4. A constitutively active non-endogenous human 5HT 2A serotonin receptor encoded 
by the cDNA of SEQ ID NO. 30 comprising SEQ ID NO. 3 1 . 

5. A cDNA encoding a constitutively active, non-endogenous version, of a human 
5HT 2A serotonin receptor comprising SEQ. ID NO. 32. 

6. A constitutively active non-endogenous human 5HT 2A serotonin receptor encoded 
by the cDNA of SEQ ID NO. 32 comprising SEQ ID NO. 33. 

7. A method for identifying whether a candidate compound is an inverse agonist to a 
non-endogenous human 5HT 2 serotonin receptor comprising the steps of: 

a. contacting the candidate compound with a non-endogenous human 5HT 2 
serotonin receptor ; and 

b. determining, by measurement of a second messenger response whether said 
compound is an inverse agonist. 

8. The method of claim 7 in which the non -endogenous human 5HT2 serotonin 
receptor comprises SEQ ID NO. 29. 

9. The method of claim 7 in which the non-endogenous human 5HT2 serotonin 
receptor comprises SEQ ID NO. 31. 

10. The method of claim 7 in which the non-endogenous human 5HT2 serotonin 
receptor comprises SEQ ID NO. 33. 

11. An inverse agonist identified by the method of claim 7. 

12. A reagent for screening compounds to determine whether the compounds are inverse 
agonists at human 5HT 2 serotonin receptors comprising a membrane fraction from 
mammalian cells transfected with and expressing a cDNA encoding for a 
constitutively active, non-endogenous version, of a human 5HT 2 serotonin receptor 
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13. 



14. 



in which the constitutively active non-endogenous human 5HT 2 receptor is 
expressed on the cell surface. 

A reagent for screening compounds to determine whether the compounds are inverse 
agonists at human 5HT 2 serotonin receptors comprising mammalian cells which 
produce a second messenger response, transfected with and expressing a cDNA 
encoding for a constitutively active, non-endogenous version, of a human 5HT 2 
serotonin receptor in which the constitutively active non-endogenous human 5HT 2 
receptor is expressed on the cell surface. 

A method for modulating by inverse agonism the activity of a human 5HT 2A 
serotonin receptor by contacting the receptor with a compound of formula: 




(A) 



Wherein: 



W is lower alkyl (C|. 6 ), or halogen; 

V is lower alkyl (Ci. 6 ), or halogen; 
X is either Oxygen or Sulfur; 

Y is NR 2 R 3 , or (CH 2 ) m R 4 , or 0(CH 2 ) n R 4 ; 
Z is lower alkyl (Ci-6); 

m = 0 - 4 
n =0-4 

R 1 is H or lower alkyl (C M ); 
R 2 is H or lower alkyl(Ci^); 
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R 3 and R 4 are independently a Ci_ 6 alkyl, or C 2 . 6 alkenyl, or cycloalkyl, or 
aryl group and each said group may be optionally substituted by up to four 
substituents in any position independently selected from CF 3 , CC1 3 , Me, N0 2 , OH, 
OMe, OEt, CONR 5 R 6 , NR 5 R 6 , OCF 3 , SMe, COOR 7 , S0 2 NR 5 R 6 , S0 3 R 7 , COMe, 
COEt, CO-lower alkyl, SCF 3 CN, C 2 _6 alkenyl, H, halogens, C,^ alkoxy, C 3 . 6 
cycloalkyl, C\. 6 alkyl, aryl, and aryloxy wherein each of the C 3 . 6 cycloalkyl, Cj. 6 
alkyl, aryl, or aryloxy groups may be further optionally substituted by up to four 
substituents in any position independently selected from CF 3 , CC1 3 , Me, N0 2 , OH, 
OMe, OEt, CONR 5 R 6 , NR 5 R 6 , NHCOCH 3 , OCF3, SMe, COOR 7 , S0 3 R 7 , 
S0 2 NR 5 R 6 , COMe, COEt, CO-lower alkyl, SCF 3 , CN, C 2 . 6 alkenyl, H, halogens, C N 
4 alkoxy, C 3 . 6 cycloalkyl, Ci-^ alkyl, and aryl; 

R 5 and R 6 are independently a H, or Ci^ alkyl, or C 2 . 6 alkenyl, or 
cycloalkyl, or aryl, or CH 2 aryl group and each said group may be optionally 
substituted by up to four substituents in any position independently selected from 
CF 3 , CC1 3 , Me, N0 2 , OH, OMe, OEt, CONR 7 R 8 , NR 7 R 8 , NHCOCH 3 , OCF 3 , SMe, 
COOR 9 , S0 3 R 7 , S0 2 NR 7 R 8 , COMe, COEt, CO-lower alkyl, SCF 3 , CN, C 2 . 6 alkenyl, 
H, halogens, Cm alkoxy, C 3 . 6 cycloalkyl, Ci- 6 alkyl, and aryl wherein each of the 
C 3 -6 cycloalkyl, d. 6 alkyl, or_aryl groups may be further optionally substituted by 
up to four substituents in any position independently selected from CF 3 , CC1 3 , Me, 
N0 2 , OH, OMe, OEt, CONR 8 R 9 , NR 8 R 9 , NHCOCH 3 , OCF 3 , SMe, COOR 7 , 
S0 2 NR 8 R 9 , S0 3 R 7 , COMe, COEt, CO-lower alkyl, SCF 3 , CN, C 2 . 6 alkenyl, H, 
halogens, C M alkoxy, C 3 . 6 cycloalkyl, C|. 6 alkyl, and aryl, 

or R 5 and R 6 may form part of a 5, 6 or 7 membered cyclic structure which 
may be either saturated or unsaturated and that may contain up to four heteroatoms 
selected from O, N or S and said cyclic structure may be optionally substituted by up 
to four substituents in any position independently selected from CF 3 , CC1 3 , Me, 
N0 2 , OH, OMe, OEt, OCF 3 , SMe, COOR 7 , S0 2 NR 8 R 9 S0 3 R 7 , NHCOCH 3 , COEt. 
COMe, or halogen; 

R 7 may be independently selected from H or Cj-6 alkyl; 

R 8 and R 9 are independently a H, or d. 6 alkyl, or C 2 . 6 alkenyl, or 
cycloalkyl, or aryl, or CH 2 aryl group and each said group may be optionally 
substituted by up to four substituents in any position independently selected from 
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halogen, CF 3 , OCF3, OEt, CC1 3 , Me, N0 2 , OH, OMe, SMe, COMe, CN, COOR 7 , 
SO3R 7 , COEt, NHCOCH3, or aryl; 

an aryl moiety can be a 5 or 6 membered aromatic heterocyclic ring (containing up 
to 4 hetero atoms independently selected from N, O, or S) or a 6 membered aromatic non- 
heterocyclic ring or a polycycle; 

15. A method for modulating by inverse agonism the activity of a human 5HT 2A 
serotonin receptor by contacting the receptor with a compound of formula: 




(B) 

Wherein: 

W is Me, or Et, or halogen; 

X is either Oxygen or Sulfur; 

Y is NR 2 R 3 , or (CH 2 ) m R 4 , or 0(CH 2 ) n R 4 ; 

Z is lower alkyl (Ci^); 

m = 0-4 

n =0-4 

R 1 is H or lower alkyl (Cm); 
R 2 is H or lower alkyl(C M ); 

R 3 and R 4 are independently aC w alkyl, or C 2 . 6 alkenyl, or cycloalkyl, or 
aryl group and each said group may be optionally substituted by up to four 
substituents in any position independently selected from CF 3 , CC1 3 , Me, N0 2 , OH. 
OMe, OEt, CONR 5 R 6 , NR 5 R 6 , OCF 3 , SMe, COOR 7 , S0 2 NR 5 R 6 , SO3R 7 , COMe. 
COEt, CO-lower alkyl, SCF3CN, C 2 . 6 alkenyl, H, halogens. C M alkoxy, C 3 . 6 
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cycloalkyl, C N6 alkyl, aryl, and aryloxy wherein each of the C 3 . 6 cycloalkyl, C,. 6 
alkyl, aryl, or aryloxy groups may be further optionally substituted by up to four 
substituents in any position independently selected from CF 3 , CC1 3 , Me, N0 2 , OH, 
OMe, OEt, CONR 5 R 6 , NR 5 R 6 , NHCOCH 3 , OCF3, SMe, COOR 7 , S0 3 R 7 , 
S0 2 NR 5 R 6 , COMe, COEt, CO-lower alkyl, SCF 3 , CN, C 2 . 6 alkenyl, H, halogens, C N 
4 alkoxy, C 3 . 6 cycloalkyl, Cm alkyl, and aryl; 

R 5 and R 6 are independently a H, or Cm alkyl, or C 2 . 6 alkenyl, or 
cycloalkyl, or aryl, or CH 2 aryl group and each said group may be optionally 
substituted by up to four substituents in any position independently selected from 
CF 3 , CC1 3 , Me, N0 2 , OH, OMe, OEt, CONR 7 R 8 , NR 7 R 8 , NHCOCH 3 , OCF 3 , SMe, 
COOR 9 , S0 3 R 7 , S0 2 NR 7 R 8 , COMe, COEt, CO-lower alkyl, SCF 3 , CN, C 2 . 6 alkenyl, 
H, halogens, Cm alkoxy, C 3 ^ cycloalkyl, Cm alkyl, and aryl wherein each of the 
C 3 . 6 cycloalkyl, Cm alkyl, or_aryl groups may be further optionally substituted by 
up to four substituents in any position independently selected from CF 3 , CC1 3 , Me, 
N0 2 , OH, OMe, OEt, CONR 8 R 9 , NR 8 R 9 , NHCOCH 3 , OCF 3 , SMe, COOR 7 , 
S0 2 NR 8 R 9 , S0 3 R 7 , COMe, COEt, CO-lower alkyl, SCF 3 , CN, C 2 . 6 alkenyl, H, 
halogens, Cm alkoxy, C 3 -6 cycloalkyl, Cm alkyl, and aryl, 

or R 5 and R 6 may form part of a 5, 6 or 7 membered cyclic structure which 
may be either saturated or unsaturated and that may contain up to four heteroatoms 
selected from O, N or S and said cyclic structure may be optionally substituted by up 
to four substituents in any position independently selected from CF 3 , CC1 3 , Me, 
N0 2 , OH, OMe, OEt, OCF 3 , SMe, COOR 7 , S0 2 NR 8 R 9 S0 3 R 7 , NHCOCH 3 , COEt, 
COMe, or halogen; 

R 7 may be independently selected from H or Cm alkyl; 

R 8 and R 9 are independently a H, or Cm alkyl, or C 2 . 6 alkenyl, or 
cycloalkyl, or aryl, or CH 2 aryl group and each said group may be optionally 
substituted by up to four substituents in any position independently selected from 
halogen, CF 3 , OCF3, OEt, CC1 3 , Me, N0 2 , OH, OMe, SMe, COMe, CN, COOR 7 , 
S0 3 R 7 , COEt, NHCOCH 3 , or aryl; 

an aryl moiety can be a 5 or 6 membered aromatic heterocyclic ring (containing up 
to 4 hetero atoms independently selected from N, O, or S) or a 6 membered aromatic non- 
heterocyclic ring or a polycycle. 
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16. A method for modulating by inverse agonism the activity of a human 5HT 2 a 
serotonin receptor by contacting the receptor with a compound of formula: 




Wherein: 

Preferably R ! and R 2 are H. 
Preferably W is Br. 
Preferably X is O. 
Preferably Z is Me. 

R 3 is Ci_6 alkyl, or C 2 . 6 alkenyl, or cycloalkyl, or aryl group and each said group 
may be optionally substituted by up to four substituents in any position 
independently selected from CF 3 , CC1 3 , Me, N0 2 , OH, OMe, OEt, CONR 5 R 6 , 
NR 5 R 6 , OCF 3 , SMe, COOR 7 , S0 2 NR 5 R 6 , SO3R 7 , COMe, COEt, CO-lower alkyl, 
SCF3CN, C 2 _ 6 alkenyl, H, halogens, C1.4 alkoxy, C3-6 cycloalkyl, Cu 6 alkyl, aryl, and 
aryloxy wherein each of the C 3 . 6 cycloalkyl, Ci_ 6 alkyl, aryl, or aryloxy groups may- 
be further optionally substituted by up to four substituents in any position 
independently selected from CF 3 , CC1 3 , Me, N0 2 , OH, OMe, OEt, CONR 5 R 6 , 
NR 5 R 6 , NHCOCH3, OCF3, SMe, COOR 7 , SO3R 7 , S0 2 NR 5 R 6 , COMe, COEt, CO- 
lower alkyl, SCF 3 , CN, C 2 . 6 alkenyl, H, halogens, C M alkoxy, C 3 . 6 cycloalkyl, C N6 
alkyl, and aryl; 

R 5 and R 6 are independently a H, or Ci^s alkyl, or C 2 . 6 alkenyl, or 
cycloalkyl, or aryl, or CH 2 aryl group and each said group may be optionally 
substituted by up to four substituents in any position independently selected from 
CF 3 , CCI3, Me, N0 2 , OH, OMe, OEt, CONR 7 R 8 , NR 7 R 8 , NHCOCH3. OCF 3 . SMe. 
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COOR 9 , SO3R 7 , S0 2 NR 7 R 8 , COMe, COEt, CO-lower alkyl, SCF 3 , CN, C 2 . 6 alkenyl, 
H, halogens, C M alkoxy, C 3 . 6 cycloalkyl, Cm alkyl, and aryl wherein each of the 
C3.6 cycloalkyl, C|. 6 alkyl, or_aryl groups may be further optionally substituted by 
up to four substituents in any position independently selected from CF 3 , CC1 3 , Me, 
N0 2 , OH, OMe, OEt, CONR 8 R 9 , NR 8 R 9 , NHCOCH3, OCF 3 , SMe, COOR 7 , 
S0 2 NR 8 R 9 , SO3R 7 , COMe, COEt, CO-lower alkyl, SCF 3 , CN, C 2 . 6 alkenyl, H, 
halogens, Cm alkoxy, C3-6 cycloalkyl, Cm alkyl, and aryl, 

or R 5 and R 6 may form part of a 5, 6 or 7 membered cyclic structure which 
may be either saturated or unsaturated and that may contain up to four heteroatoms 
selected from O, N or S and said cyclic structure may be optionally substituted by up 
to four substituents in any position independently selected from CF 3 , CC1 3 , Me, 
N0 2 , OH, OMe, OEt, OCF 3 , SMe, COOR 7 , S0 2 NR 8 R 9 SO3R 7 , NHCOCH3, COEt, 
COMe, or halogen; 

R 7 may be independently selected from H or Cm alkyl; 

R 8 and R 9 are independently a H, or Cm alkyl, or C 2 . 6 alkenyl, or 
cycloalkyl, or aryl, or CH 2 aryl group and each said group may be optionally 
substituted by up to four substituents in any position independently selected from 
halogen, CF 3 , OCF3, OEt, CC1 3 , Me, N0 2 , OH, OMe, SMe, COMe, CN, COOR 7 , 
SO3R 7 , COEt, NHCOCH3, or aryl; 

an aryl moiety can be a 5 or 6 membered aromatic heterocyclic ring (containing up 
to 4 hetero atoms independently selected from N, O, or S) or a 6 membered aromatic non- 
heterocyclic ring or a polycycle. 

17. A method for modulating by inverse agonism the activity of a human 5HT2A 
serotonin receptor by contacting the receptor with a compound of formula: 
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R 1 




Wherein: 

Preferably R 1 is H. 
Preferably W is Br. 
Preferably X is O. 
Preferably Z is Me. 
n = 0-4 

R 4 is Ci-6 alkyl, or C2-6 alkenyl, or cycloalkyl, or aryl group and each said 
group may be optionally substituted by up to four substituents in any position 
independently selected from CF 3 , CC1 3 , Me, N0 2 , OH, OMe, OEt, CONR 5 R 6 , 
NR 5 R 6 , OCF 3 , SMe, COOR 7 , S0 2 NR 5 R 6 , SO3R 7 , COMe, COEt, CO-lower alkyl. 
SCF3CN, C 2 -6 alkenyl, H, halogens, Ci«4 alkoxy, C3-6 cycloalkyl, Ci. 6 alkyL aryl, and 
aryloxy wherein each of the C3.6 cycloalkyl, C|_6 alkyl, aryl, or aryloxy groups may 
be further optionally substituted by up to four substituents in any position 
independently selected from CF 3 , CC1 3 , Me, N0 2 , OH, OMe, OEt. CONR 5 R 6 . 
NR 5 R 6 , NHCOCH3, OCF3, SMe, COOR 7 , SO3R 7 , S0 2 NR 5 R 6 , COMe, COEt ? CO- 
lower alkyl, SCF 3 , CN, C 2 -6 alkenyl, H, halogens, C1-4 alkoxy, C3-6 cycloalkyl, Ci. 6 
alkyl, and aryl; 

R 5 and R 6 are independently a H, or Ci. 6 alkyl, or C 2 -6 alkenyl. or 
cycloalkyl, or aryl, or CH 2 aryl group and each said group may be optionally- 
substituted by up to four substituents in any position independently selected from 
CF 3 , CCI3, Me, N0 2 , Oil, OMe, OEt, CONR 7 R 8 , NR 7 R 8 , NHCOCH 3 , OCF 5 , SMe. 
COOR 9 , S0 3 R 7 , S0 2 NR 7 R 8 , COMe. COEt, CO-lower alkyl. SCF 3 . CN, C 2 - 6 alkenyl. 
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H, halogens, alkoxy, C3-6 cycloalkyl, C1-6 alkyl, and aryl wherein each of the 
C3-6 cycloalkyl, Q.6 alkyl, or_aryl groups may be further optionally substituted by 
up to four substituents in any position independently selected from CF3, CC1 3 , Me, 
N0 2 , OH, OMe, OEt, CONR 8 R 9 , NR 8 R 9 , NHCOCH3, OCF 3 , SMe, COOR 7 , 
5 S0 2 NR 8 R 9 , SO3R 7 , COMe, COEt, CO-lower alkyl, SCF 3 , CN, C 2 . 6 alkenyl, H, 

halogens, C1.4 alkoxy, C3.6 cycloalkyl, C1-6 alkyl, and aryl, 

or R 5 and R 6 may form part of a 5, 6 or 7 membered cyclic structure which 
may be either saturated or unsaturated and that may contain up to four heteroatoms 
selected from O, N or S and said cyclic structure may be optionally substituted by up 
10 to four substituents in any position independently selected from CF3, CCI3, Me, 

N0 2 , OH, OMe, OEt, OCF 3 , SMe, COOR 7 , S0 2 NR 8 R 9 , SO3R 7 , NHCOCH3, COEt, 
COMe, or halogen; 

R 7 may be independently selected from H or C1-6 alkyl; 
R 8 and R 9 are independently a H, or Ci^ alkyl, or C 2 -6 alkenyl, or 
15 cycloalkyl, or aryl, or CH 2 aryl group and each said group may be optionally 

substituted by up to four substituents in any position independently selected from 
halogen, CF 3 , OCF3, OEt, CC1 3 , Me, N0 2 , OH, OMe, SMe, COMe, CN, COOR 7 , 
SO3R 7 , COEt, NHCOCH3, or aryl; 

an aryl moiety can be a 5 or 6 membered aromatic heterocyclic ring (containing up 
20 to 4 hetero atoms independently selected from N, O, or S) or a 6 membered aromatic non- 
heterocyclic ring or a polycycle. 

18. A method for modulating by inverse agonism the activity of a human 5HT2A 
serotonin receptor by contacting the receptor with a compound of formula: 
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Wherein: 

Preferably W is Br. 
Preferably X is O. 
Preferably Z is Me. 
Preferably R 1 is H. 
m = 0-4 

R 4 is Ci-6 alkyl, or C2-6 alkenyl, or cycloalkyl, or aryl group and each said 
group may be optionally substituted by up to four substituents in any position 
independently selected from CF 3 , CC1 3 , Me, N0 2 , OH, OMe, OEt, CONR 5 R 6 , 
NR 5 R 6 , OCF3, SMe, COOR 7 , S0 2 NR 5 R 6 , S0 3 R 7 , COMe, COEt, CO-lower alkyl, 
SCF3CN, C 2 -6 alkenyl, H, halogens, C1.4 alkoxy, C3-6 cycloalkyl, Ci. 6 alkyl, aryl, and 
aryloxy wherein each of the C3.6 cycloalkyl, C\* alkyl, aryl, or aryloxy groups may- 
be further optionally substituted by up to four substituents in any position 
independently selected from CF 3 , CC1 3 , Me, N0 2 , OH, OMe, OEt, CONR 5 R 6 , 
NR 5 R 6 , NHCOCH3, OCF3, SMe, COOR 7 , SO3R 7 , S0 2 NR 5 R 6 , COMe, COEt, CO- 
lower alkyl, SCF 3 , CN, C 2 -6 alkenyl, H, halogens, C1.4 alkoxy, C 3 . 6 cycloalkyl, Ci. 6 
alkyl, and aryl; 

R 5 and R 6 are independently a H, or C 1-6 alkyl, or C 2 -6 alkenyl, or 
cycloalkyl, or aryl, or CH 2 aryl group and each said group may be optionally 
substituted by up to four substituents in any position independently selected from 
CF 3 , CCI3, Me, N0 2 , OH, OMe, OEt, CONR 7 R 8 , NR 7 R 8 , NHCOCH3, OCF 3 , SMe. 
COOR 9 , SO3R 7 , S0 2 NR 7 R 8 , COMe, COEt, CO-lower alkyl, SCF 3 , CN. C 2 _ 6 alkenyl. 
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H, halogens. Cm alkoxy, C3-6 cycloalkyl, C1-6 alkyl, and aryl wherein each of the 
C 3 ^ cycloalkyl, Ci-6 alkyl, or^aryl groups may be further optionally substituted by 
up to four substituents in any position independently selected from CF 3 , CC1 3 , Me, 
N0 2 , OH, OMe, OEt, CONR 8 R 9 , NR 8 R 9 , NHCOCH3, OCF 3 , SMe, COOR 7 , 
S0 2 NR 8 R 9 , S0 3 R 7 , COMe, COEt, CO-lower alkyl, SCF 3 , CN, C 2 _ 6 alkenyl, H. 
halogens, Cm alkoxy, C 3 . 6 cycloalkyl, Ci_6 alkyl, and aryl, 

or R 5 and R 6 may form part of a 5, 6 or 7 membered cyclic structure which 
may be either saturated or unsaturated and that may contain up to four heteroatoms 
selected from O, N or S and said cyclic structure may be optionally substituted by up 
to four substituents in any position independently selected from CF 3 , CC1 3 , Me, 
N0 2 , OH, OMe, OEt, OCF 3 , SMe, COOR 7 , S0 2 NR 8 R 9 , SO3R 7 , NHCOCH 3 , COEt, 
COMe, or halogen; 

R 7 may be independently selected from H or Ci^ alkyl; 

R 8 and R 9 are independently a H, or C|. 6 alkyl, or C 2 _ 6 alkenyl, or 
cycloalkyl, or aryl, or CH 2 aryl group and each said group may be optionally 
substituted by up to four substituents in any position independently selected from 
halogen, CF 3 , OCF3, OEt, CC1 3 , Me, N0 2 , OH, OMe, SMe, COMe, CN, COOR 7 , 
S0 3 R 7 , COEt, NHCOCH 3 , or aryl; 

an aryl moiety can be a 5 or 6 membered aromatic heterocyclic ring (containing up 
to 4 hetero atoms independently selected from N, O, or S) or a 6 membered aromatic non- 
heterocyclic ring or a polycycle. 
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19. A compound of formula (C) 

R 1 




(C) 
Wherein: 

5 W is Me, or Et, or halogen; 

X is either Oxygen or Sulfur; 

Y is NR 2 R 3 , or (CH 2 ) m R 4 , or 0(CH 2 ) n R 4 ; 

Z is lower alkyl (C|.6); 

m = 0-4; 

10 n = 0-4; 

R 1 is H or lower alkyl (C h4 )\ 
R 2 is H or lower alkyl(C M ); 

R 3 is a Ci.6 alkyl, or C2-6 alkenyl, or cycloalkyl, or (CH 2 )i<aryl group (k = 1 - 
4), preferably k = 1 , and each said group may be optionally substituted by up to four 

15 substituents in any position independently selected from CF3, CC1 3 , Me, N(X OH. 

OMe, OEt, CONR 5 R 6 , NR 5 R 6 , OCF 3 , SMe, COOR 7 , S0 2 NR 5 R 6 , SO3R 7 , COMe. 
COEt, CO-lower alkyl, SCF3CN, C 2 - 6 alkenyl, H, halogens, d_4 alkoxy, C 3 . 6 
cycloalkyl, alkyl, aryl, and aryloxy wherein each of the C3-6 cycloalkyl, Ci. 6 
alkyl, aryl, or aryloxy groups may be further optionally substituted by up to four 

20 substituents in any position independently selected from CF3, CCI3, Me. NO:. OH. 

OMe, OEt, CONR 5 R 6 , NR 5 R 6 , NHCOCH 3 , OCF3, SMe, COOR 7 . SO3R 7 . 
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S0 2 NR 5 R 6 , COMe, COEt, CO-lower alkyl, SCF 3 , CN, C 2 . 6 alkenyl, H, halogens, C,_ 
4 alkoxy, C 3 . 6 cycloalkyl, Ci. 6 alkyl, and aryl; 

R 4 is a C 1-6 alkyl, or C 2 . 6 alkenyl, or cycloalkyl, or aryl group and each said 
group may be optionally substituted by up to four substituents in any position 
independently selected from CF 3 , CC1 3 , Me, N0 2 , OH, OMe, OEt, CONR 5 R 6 , 
NR 5 R 6 , OCF3, SMe, COOR 7 , S0 2 NR 5 R 6 , SO3R 7 , COMe, COEt, CO-lower alkyl, 
SCF3CN, C 2 - 6 alkenyl, H, halogens, Cm alkoxy, C 3 . 6 cycloalkyl, Ci- 6 alkyl, aryl, and 
aryloxy wherein each of the C 3 -$ cycloalkyl, C1-6 alkyl, aryl, or aryloxy groups may 
be further optionally substituted by up to four substituents in any position 
independently selected from CF 3 , CC1 3 , Me, N0 2 , OH, OMe, OEt, CONR 5 R 6 , 
NR 5 R 6 , NHCOCH3, OCF3, SMe, COOR 7 , SO3R 7 , S0 2 NR 5 R 6 , COMe, COEt, CO- 
lower alkyl, SCF3, CN, C 2 . 6 alkenyl, H, halogens, C M alkoxy, C 3 . 6 cycloalkyl, Ci_ 6 
alkyl, and aryl; 

R 5 and R 6 are independently a H, or C 1-6 alkyl, or C 2 .6 alkenyl, or 
cycloalkyl, or aryl, or CH 2 aryl group and each said group may be optionally 
substituted by up to four substituents in any position independently selected from 
CF 3 , CCI3, Me, N0 2 , OH, OMe, OEt, CONR 7 R 8 , NR 7 R 8 , NHCOCH 3 , OCF 3 , SMe, 
COOR 9 , SO3R 7 , S0 2 NR 7 R 8 , COMe, COEt, CO-lower alkyl, SCF 3 , CN, C 2 . 6 alkenyl, 
H, halogens, Cm alkoxy, C3-6 cycloalkyl, C1.6 alkyl, and aryl wherein each of the 
C3-6 cycloalkyl, Cu6 alkyl, or_aryl groups may be further optionally substituted by 
up to four substituents in any position independently selected from CF3, CCI3, Me, 
N0 2 , OH, OMe, OEt, CONR 8 R 9 , NR 8 R 9 , NHCOCH3, OCF 3 , SMe, COOR 7 , 
S0 2 NR 8 R 9 , SO3R 7 , COMe, COEt, CO-lower alkyl, SCF 3 , CN, C 2 . 6 alkenyl. H. 
halogens, Cm alkoxy, C 3 . 6 cycloalkyl, d-6 alkyl, and aryl, 

or R 5 and R 6 may form part of a 5, 6 or 7 membered cyclic structure which 
may be either saturated or unsaturated and that may contain up to four heteroatoms 
selected from O, N or S and said cyclic structure may be optionally substituted by up 
to four substituents in any position independently selected from CF 3 , CC1 3 , Me. 
N0 2 , OH, OMe, OEt, OCF 3 , SMe, COOR 7 , S0 2 NR 8 R 9 SO3R 7 , NHCOCH 3 , COEt. 
COMe, or halogen; 

R 7 may be independently selected from H or C 1-6 alkyl; 

R 8 and R 9 are independently a H, or Ci- 6 alkyl, or C 2 -6 alkenyl. or 
cycloalkyl, or aryl, or CH 2 aryl group and each said group may be optionally 
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substituted by up to four substituents in any position independently selected from 
halogen, CF 3 , OCF3, OEt, CC1 3 , Me, N0 2 , OH, OMe, SMe, COMe, CN, COOR 7 , 
SO3R 7 , COEt, NHCOCH3, or aryl; 

an aryl moiety can be a 5 or 6 membered aromatic heterocyclic ring (containing up 
to 4 hetero atoms independently selected from N, O, or S) or a 6 membered aromatic non- 
heterocyclic ring or a polycycle; 

with the proviso that said compound is not: 
N-[3-(4-bromo- 1 -methylpyrazol-3-yl)phenyl] [methylamino]carboxamide, or 
N- [3 -(4-bromo- 1 -methy lpyrazol-3 -y l)pheny 1] [ { (4-trifluoromethoxy)pheny 1 } amino] 
carboxamide, or 

N-[3-(4-bromo- 1 -methy lpyrazol-3 -yl)phenyl] [2-chlorophenyl] carboxamide, or 
N- [3 -(4-bromo- 1 -methy lpyrazol-3 -yl)phenyl] [2-chloro-3-pyridyl]carboxamide, or 
N-[3-(4-bromo- 1 -methylpyrazol-3-yl)phenyl] [trichloromethyl]carboxamide. 
20. The use of a compound of claim 19 for the manufacture of a medicament. 
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FIGURE 3A 

ATGGATATTCTTTGTGAAGAAAATACTTCTTTGAGCTCAACTACGAACTCCCTAATGCAATTA 

AATGATGACAACAGGCTCTACAGTAATGACTTTAACTCCGGAGAAGCTAACACTTCTGATGCA 

TTTAACTGGACAGTCGACTCTGAAAATCGAACCAACCTTTCCTGTGAAGGGTGCCTCTCACCG 

TCGTGTCTCTCCrrACTrCATCTCCAGGAAAAAAACTGGTCTGCTTTACT 

TrATTCTAACTATrGCTGGAAACATACTCGTCATCATGGCAGTGTCCCTAGAGAAAAAGCTGC 

AGAATGCCACCAACTATITCCTGATGTCACTTGCCATAGCrGATATGCTGCTGGGTTTCCTTGT 

CATGCCCGTGTCCATGTTAACCATCCTGTATGGGTACCGGTGGCCTCTGCCGAGCAAGCTTTGT 

GCAGTCTGGATTTACCTGGACGTGCrCTTCTCCACGGCCTCCATCATGCACCTCTGCGCCATCT 

CGCTGGACCGCTACGTCGCCATCCAGAATCCCATCCACCACAGCCGCTTCAACTCCAGAACTA 

AGGCATITCTGAAAATCATTGCTGTTTGGACCATATCAGTAGGTATATCCATGCCAATACCAG 

TCTTTGGGCTACAGGACGATTCGAAGGTCJITrAAGGAGGGGAGTTGCTrACTCGCCGATGATA 

ACTTTGTCCTGATCGGCTCTTTTGTGTCATTTTTCATTCCCITAACCATCATGG 

TTTCTAACTATCAAGTCACTCCAGAAAGAAGCTACTTTGTGTGTAAGTGATCTTGGCACACGG 

GCCAAATTAG CIT CI 1 1 CAGCTTCCTCCCTCAGAGTTCITTGTCTTCAGAAAAGCTCTTCCAGC 

GGTCGATCCATAGGGAGCGAGGGTCCTACACAGGCAGGAGGACTATGCAGTCCATCAGCAAT 

GAGCAAAAGGCATGCAAGGTGCrGGGCATCGTCTTCTTCCTGTTTGTGGTGATGTGGTGCCCT 

TTCTTCATCACAAACATCATGGCCGTCATCTGCAAAGAGTCCTGCAATGAGGATGTCATTGGG 

GCCCTGCTCAATGTGTTTGTTTGGATCGGTTATCTCT 

CACTGTTCAACAAGACCTATAGGTGAGCCTTTTCACGGTATATTCAGTGTCAGTACAAGGAAA 
ACAAAAAACCATTGCAGTTAATTTTAGTGAACACAATACCGGCTTTGGCCT 
AACTTCAAATGGGACAAAAAAAGAATTCAAAGCAAGATGCCAAGACAACAGATAATGACTGC 
TCAATGGTTGCTCTAGGAAAGCAGIATTCTGAAGAGGCTTCTAAAGACAATAGCGACGGAGT 

GAATGAAAAGGTGAGCTGTGTGTGA 
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FIGURE 3B 

MDILCEE>rrSLSSTTNSLMQL>roDEfRLYS>roFNSGE^ 

SLLHLQEKNWSALLTAVVIILTIAGNILVTMAVSL^ 

LmYGYRWPLPSKLCAVWIYLDVLFSTASIMHLCA^ 

TISVGISMPIPWGLQDDSKVFKEGSCLIADDNFVU^ 

DLGTFJUCLASFSFLPQSSLSSEKLFQRSIHREPGSYTGPJt™^ 

PFFTIMMAVICKESCNEDVIGAI^^ 
PLQULVNTTPAIAYKSSQLQMGQKXNSKQDAKTTO^ 

SCV 

FIGURE 4B 

MVNLRNAVHS FL VHLIGLL VWQCD I S VS PVAAI VTD I FNTSDGGRFKFPDGVQNWP ALS I VI 1 1 IMTI GGN 

ILVlMAVSMEKKLffllATinfFLMSIAXADMLVGLLVMPLSIiLAILTO SLDVLFSTAS I 

MHLCAISIJJRYVAIRNPIEHSRFNSRTKAIMKI^^ 

FVLIGSFVAFFIPLTIMVITYCLTIYVLRRQA^ 

NARRFJCKKERRPRGTMQAI1INERKASKVLG 

IGYVCSGINPLVYTLFNKIYRRAFSNYLROmCVEKKPPWQIPRVAA 
ASDNEPGIEMQVENLELPVNPSSWSERISSV 
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FIGURE 4A 



ATGGTGAACCTGAGGAATGCGGTGCATTCATTCCTTGTGCACCTAATTGOCCTATTGGTTTGGC 

AATGTGATATTTCTGTGAGCCCAGTAGCAGCrATAGTAACTGACATTTTCAATACCTCCGATG 

GTGGACGCITCAAATTCCCAGACGGGGTACAAAACTGGCCAGCACTTTCAATCGTCATCATAA 

TAATCATGACAATAGGTGGCAACATCCTTGTGATCATGGCAGTAAGCATGGAAAAGAAACTG 

CACAATGCCACCAATTACTTCTTAATGTCCCTAGCCATTGCTGATATGCTAGTGGGACTACTTG 

TCATGCCCCTGTCTCTCCTGGCAATCCTrTATGATTATGTCTGGCCACTACCTAGATATTTGTG 

CCCCGTCTGG ATTTCI 1 1A GATGTTTTATTTTCAACAGCGTCCATCATGCACCTCTGCGCTATAT 

CGCTGGATCGGTATGTAGCAATACGTAATCCTATTGAGCATAGCCGTTTCAATTCGCGGACTA 

AGGCCATCATGAAGATTGCTATTGTTTGGGCAATTTCTATAGGTGTATCAGTTCCTATCCCTGT 

GATTGGACTGAGGGACGAAGAAAAGGTGTTCGTGAACAACACGACGTGCGTGCTCAACGACC 

CAAATTTCGTTCTTATTGGGTCCTTCGTAGCl'l 1 C 1 1 CATACCGCTGACGATTATGGTGATTAC 

GTATTGCCTGACCATCTACGTTCTGCGCCGACAAGCTTTGATGTTACTGCACGGCCACACCGA 

GGAACCGCCTGGACTAAGTCTGGATTrCCTGAAGTGCTGCAAGAGGAATACGGCCGAGGAAG 

AGAACTCTGCAAACCCTAACCAAGACCAGAACGCACGCCGAAGAAAGAAGAAGGAGAGACG 

TCCTAGGGGCACCATGCAGGCTATCAACAATGAAAGAAAAGCITCGAAAGTCCTTGGGATTG 

TTITCI 1 1GTGTTTCTGATCATGTGGTGCCCATTTTTCATTACCAATATTCTGTCTGTTCTTTGTG 

AGAAGTCCTGTAACCAAAAGCTCATGGAAAAGCTTCTGAATGTGTTTGTTTGGATTGGCTATG 

TTTGTTCAGGAATCAATCCTCTGGTGTATACTCTGTTCAACAAAATTTACCGAAGGGCATTCTC 

CAACTATTTGCGTTGCAATTATAAGGTAGAGAAAAAGCCTCCTGTCAGGCAGATTCCAAGAGT 

TGCCGCCACTTGCITrGTCTGGGAGGGAGCrTAATGTTAACATTTATCGGCATACCAATGA^ 

GGTGATCGAGAAAGCCAGTGACAATGAGCCCGGTATAGAGATGCAAGTTGAGAATTTAGAGT 

TACCAGTAAATCCCTCCAGTGTGGTTAGCGAAAGGATTAGCAGTGTGTGA 
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FIGURE 5A 

ATGGTGAACCTGAGGAATGCGGTGCATTCATTCCTTGTGCACCTAATTGGCCTATTGGTTTGGCAAT 

GTGATATTTCTGTGAGCCCAGTAGCAGCTATAGTAACTGACATTTTCAATACCTCCGATGGTGGACG 

CTTCAAATrCCCAGACGGGGTACAAAACTGGCCAGCACTTTCAATCGTCATCATAATAATCATGAC 

AATAGGTGGCAACATCCTTGTGATCATGGCAGTAAGCATGGAAAAGAAACTGCACAATGCCACCA 

ATTACTTCTTAATGTCCCTAGCCATrGCTGATATGCTAGTGGGACTACTTGTCATGCCCCTGTCTCTC 

CTGGCAATCCTTTATGATTATGTCTGGCCACTACCTAGATATTTGTGCCCCGTCTGGA I 1 TCI 1 1AGA 

TGTTTTATTTTCAACAGCGTCCATCATGCACCTCTGCGCTATATCGCTGGATCGGTATGTAGCAATA 

CGTAATCCTATTGAGCATAGCCGTTTCAATTCGCGGACTAAGGCCATCATGAAGATTGCTATTGTTT 

GGGCAATTTCTATAGGTGTATCAGTTCCTATCCCTGTGATTGGACTGAGGGACGAAGAAAAGGTGT 

TCGTGAACAACACGACGTGCGTGCrCAACGACCCAAATTTCGTTCTTATTGGGTCCTTCGTAGCTTT 

CTTCATACCGCTGACGATTATGGTGATTACGTATTGCCTGACCATCTACGTTCTGCGCCGACAAGCT 

TTGATGTTACTGCACGGCCACACCGAGGAACCGCCTGGACTAAGTCTGGATTTCCTGAAGTGCTGC 

AAGAGGAATACGGCCGAGGAAGAGAACTCTGCAAACCCTAACCAAGACCAGAACGCACGCCGAA 

GAAAGAAGAAGGAGAGACGTCCTAGGGGCACCATGCAGGCTATCAACAATGAAAGAAAAGCTAA 

GAAAGTCCTTGGGATTG 1 1 1 1111 1 GTGTTTCTGATCATGTGGTGCCCATTTTTCATTACCAATATTC 

TGTCTGTTCTTTGTGAGAAGTCCTGTAACCAAAAGCTCATGGAAAAGCTTCTGAATGTGTTTGTTTG 

GATTGGCrATGTTTGTTCAGGAATCAATCCTCTGGTGTATACTCTGTTCAAGAAAATTTACCGAAGG 

GCATTCTCCAACTATTTGCGTTGGAATTATAAGGTAGAGAAAAAGCCTCCTGTCAGGCAGATTCCA 

AGAGTTGCCGCCACTGCTTTGTCTGGGAGGGAGCTTAATGTrAACATTrATCGGCATACCAATGAA 

CCGGTGATCGAGAAAGCCAGTGAGAATGAGCCCGGTATAGAGATGCAAGTTGAGAATTTAGAGTT 

ACCAGTAAATCCCTCCAGTGTGGTTAGCGAAAGGATTAGCAGTGTGTGA 
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FIGURE 5B 



NTVNLRNAVHSFLVHLIGLLVWQCDISVSPVA^ 

GGNILVIMAVSMEKKLHNATKm.MSIjUADMLVGLLVMPLSL^ 

DVLFSTASIMHLCAISLDRYVAIRNPIEHSRi^SRTKAINIKIAIWAIS 

NI s nTCVTJ^PNFVLIGSFVAFFIPLTIMVITYCLT^ 

TAEEENSANPNQDQNAFJIRKKKERRPRGTMQAINNERKA^ 

(^KSCNQKIMEKLLNVFVHATIGYVCSGINP 

AATALSGRELNVNIYRHT>mPVIEKASDNEPGIEMQV^ 



MDILCEE>TrSLSSTTNSLMQLNDDr*rRLYS>roFNSGEA>r^^ 

SLIJILQEKNWSALLTAVVIILTIAGNILVIMAVSL 

LmYGYRWPIJSKLCAVWIYIJDVLFSTASIM^ 

TISVGISMPIPWGLQDDSKVFKEGSCLIADDNFVUGSFVSFFTPL^^ 

HGHTEEPPGLSLDFXKCCKR>TrAEEENSA>rPNQDQNAR^ 

KVLGIVFFIJFVVMWCPFFIT>nMAVICKESCNEDV^ 

RAFSITCIJilCT>r¥KVTSKKPPVRQIPRVAATAl^ 

NLELPVNPSSWSERISSV 



FIGURE 6B 



FIGURE 6C 
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FIGURE 6 A 

ATGGATATTCITTGTGAAGAAAATACTrCTTTGAGCrCAACTACGAACTCCCTAATGCAATTA 

AATGATGACAACAGGCTCTACAGTAATGACTTTAACTCCGGAGAAGCTAACACTTCTGATGCA 

TITAACTGGACAGTCGACTCTGAAAATCGAACCAACCTrrCCTGTGAAGGGTGCCTCTCACCG 

TCGTGTCTCTCCTrACTTCATCTCCAGGAAAAAAACTGGTCTGCTTTACTGACAGCCGTAGTGA 

TTATTCTAACrATTGCTGGAAACATACTCGTCATCATGGCAGTGTCCCTAGAGAAAAAGCTGC 

AGAATGCCACCAACTATTTCCTGATGTCACTTGCCATAGCrGATATGCTGCTGGGTTTCCTTGT 

CATGCCCGTGTCCATGTTAACCATCCTGTATGGGTACCGGTGGCCTCTGCCGAGCAAGCTTTGT 

GCAGTCTGGATTTACCTGGACGTGCTCTTCTCCACGGCCTCCATCATGCACCTCTGCGCCATCT 

CGCTGGACCGCTACGTCGCCATCCAGAATCCCATCCACCACAGCCGCTTCAACTCCAGAACTA 

AGGCATTTCTGAAAATCATTGCTGTTTGGACCATATCAGTAGGTATATCCATGCCAATACCAG 

TCTTTGGGCTACAGGACGATTCGAAGGTCTTTAAGGAGGGGAGTTGCTTACTCGCCGATGATA 

ACTTTGTCCTGATCGGCTCTTITGTGTCATITITCATTCCCTTAACCATCATGGTGATCACCTAC 

TTTCTAACTATCAAGGTTCTGCGCCGACAAGCTTTGATGTTACTGCACGGCCACACCGAG 

GAACCGCCTGGACTAAGTCTGGATTTCCTGAAGTGCTGCAAGAGGAATACGGCCGAGGA 

AGAGAACTCTGCAAACCCTAACCAAGACCAGAACGCACGCCGAAGAAAGAAGAAGGAG 

AGACGTCCTAGGGGCACCATGCAGGCTATCAACAATGAAAGAAAAGCTTCGAAGGTACT 

GGGCATCGTCITCTrCCrGTrTGTGGTGATGTGGTGCCCTTTCTTCATCACAAACATCATGGCC 

GTCATCTGCAAAGAGTCCTGCAATGAGGATGTCATTGGGGCCCTGCTCAATGTGTTTGTTTGG 

ATCGGTTATCTCTCTTGAGCAGTGAACCCACrAGTCrATACTCTGTTCAACAAAATTTACCGA 

AGGGCATTCTCCAACTATTTGCGTTGCAATTATAAGGTAGAGAAAAAGCCTCCTGTCAG 

GCAGATTCCAAGAGTTGCCGCCACTGCITTGTCTGGGAGGGAGCTTAATGTTAACATTT 

ATCGGCATACCAATGAACCGGTGATCGAGAAAGCCAGTGACAATGAGCCCGGTATAGAG 

ATGCAAGTTGAGAATTTAGAGTTACCAGTAAATCCCTCCAGTGTGGTTAGCGAAAGGAT 

TAGCAGTGTGTGA 
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FIGURE 7A 

ATGGATATTCTTTGTGAAGAAAATACITCTTTGAGCrCAACTACGAACTCCCTAA 

AATGATGACAACAGGCTCTACAGTAATGACTTTAACTCCGGAGAAGCTAACACTTCTGATGCA 

TTTAACTGGACAGTCGACTCTGAAAATCGAACCAACCTTrCCTGTGAAGGGTGCCTCTCACCG 

TCGTGTCTCTCCTTACTTCATCTCCAGGAAAAAAACTGGTCTGCTTTACTGACAGCCGTAGTGA 

TTATTCTAACTATTGCTGGAAACATACTCGTCATCATGGCAGTGTCCCTAGAGAAAAAGCTGC 

AGAATGCCACCAACTATTTCCTGATGTCACTTGCCATAGCrGATATGCTGCTGGGTTTCCTTGT 

CATGCCCGTGTCCATGTrAACCATCCTGTATGGGTACCGGTGGCCTCTGCCGAGCAAGCTTTGT 

GCAGTCTGGATTTACCTGGACGTGCTCTTCTCCACGGCCTCCATCATGCACCTCTGCGCCATCT 

CGCTGGACCGCTACGTCGCCATCCAGAATCCCATCCACCACAGCCGCTTCAACTCCAGAACTA 

AGGCATTTCTGAAAATCATTGCTGTTTGGACCATATCAGTAGGTATATCCATGCCAATACCAG 

TCTTTGGGCTACAGGACGATTCGAAGGTCTTTAAGGAGGGGAGTTGCTTACTCGCCGATGATA 

ACTrrGTCCTGATCGGCTCTlTTGTGTC^ 

ATTGCCTGArrATf^ArCTTrTC^GCC flArAAGCTTTGATGTTACTGCACGGCCACACC 
GAGGAACCGrfT-f^ACTA A nTrTG^ATTTrrTnAAGTGGTGCAAGAGGAATACGGCCGA 

GGAAGAGAA C TrTnrAAACCrrAACCAAGAGGAGAACGCACGGGgAAgAAAGAAGAAQ 
CAGAGACGTrrTAnGGGC A rrATGCAGGrTATCAACAATGAAAGAAAAGCTAAGAAAGT 

CCTTrGGG ATTKTTTTr*TTTf;TGTTTCTG ATCA TGTGGTGCCC1 TTCTI CATCACAAACATCA 
TGGCCGTCATCTGCAAAGAGTCCTGCAATGAGGATGTCATTGGGGCCCTGCTCAATGTGTTTG 

TTTGGATCGGTTATCTCTCTTCAGCAGTCAACCCACTAGTC 

ArrGAAGGG ^ ATTrT'f'r a act atttgggttgt A ATT ATAAGGTAQAQAAA A AGCCTCQT 
CTCAGGCAGA T rrrAAGAcrrTr^CGCCACT^^ 

rATTTATCGGCATArrAATG A ArrGCTGATrnAGAAAGCCAGTGACAATQAGCCCGGTA 
TAGAGATGCAAfSTTnAGA AT T TArSAfyTTAGCAGTA A ATCCCTCGAGTGTGQTTAGCGAA 
A CidATTXGCA GTGTGTG A 
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FIGURE 7B 

MDILOiENTSLSSTTNSIJrfQLNDDJVRL^ 

SIXHLQEKNWSALI-TAVVIILTIAGNILVIMAVSI^KKLQNATNY 

LTILYGYRWPIJSia.CAVWIYLDVIJSTASIMHLCAI^ 

TISVGISMPIPWGLQDDSKVFKEGSCIJADDNFVIJGSFVSFFIPLTIMVTrYC ^ 
LHGHTEEPPOTST.PFLKCCKRNTAEEENSAT^ 

^TKVLGIVFTVFLIMWg > FFrr>nMAVICKESC^^ WTT FNTOY 

RRAFSNYLRCIVVTCV^KK3»PVTtOIPRVAATALSGI^L 

ENLELPVNP SSWSERISSV 
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FIGURE 15 
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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

(i) APPLICANTS: Arena Pharmaceuticals, Inc. and Tripos, Inc. 



(ii) TITLE OF INVENTION: Non-Endogenous, Constitutively Activated 

Human Serotonin Receptors and Small Molecule Modulators Thereof 

(iii) NUMBER OF SEQUENCES: 33 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Woodcock Washburn Kurtz Mackiewicz & 
Norris LLP 

(B) STREET: One Liberty Place - 4 6th Floor 

(C) CITY: Philadelphia 

(D) STATE: PA 

(E) COUNTRY: USA 

(F) ZIP: 19103 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: WINDOWS NT, Version #4.0 

(D) SOFTWARE: Patentln Release #1.0, Version #1.30 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: PCT/US99/ 08168 

(B) FILING DATE: April 14, 1999 

(C) CLASSIFICATION: 435 

(viii) ATTORNEY/ AGENT INFORMATION: 

(A) NAME: Mark J. Rosen 

(B) REGISTRATION NUMBER: 39,822 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: (215) 568-3100 

(B) TELEFAX: (215) 568-3439 



(2) INFORMATION FOR SEQ ID NO : 1 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 : 
GACCTCGAGG TTGCTTAAGA CTGAAGC 
(3) INFORMATION FOR SEQ ID NO : 2 : 

1 



SUBSTITUTE SHEET (RULE 26) 



WO 99/52927 



PCT/US99/08168 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 27 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:2: 
ATTTCTAGAC AT AT GT AG CT TGTACCG 
(4) INFORMATION FOR SEQ ID NO : 3 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 50 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 : 
CTAGGGGCAC CAT G CAG GCT ATCAACAATG AAAGAAAAGC TAAGAAAGTC 50 
(5) INFORMATION FOR SEQ ID NO : 4 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 50 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 : 
CAAGGACTTT CTTAGCTTTT CTTTCATTGT TGATAGCCTG CATGGTGCCC 50 
(6) INFORMATION FOR SEQ ID NO : 5 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 5 : 
GACCTCGAGT CCTTCTACAC CTCATC 2 6 

(7) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 6 : 
TGCTCTAGAT TCCAGATAGG TGAAAACTTG 3 0 

(8) INFORMATION FOR SEQ ID NO : 7 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 7 : 
CAAAGAAAGT ACTGGGCATC GTCTTCTTCC T 31 
(9) INFORMATION FOR SEQ ID NO : 8 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 
CCGCT CGAGT ACT GCGCCGA CAAGCTTTGA T 31 
(10) INFORMATION FOR SEQ ID NO : 9 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 38 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 9 : 
CGATGCCCAG CACTTTCGAA GCTTTTCTTT CATTGTTG 
(11) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 
AAAAGCTTCG AAAGTGCTGG GCATCGTCTT CTTCCT 
(12) INFORMATION FOR SEQ ID NO: 11 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 : 
TGCTCTAGAT TCCAGATAGG TGAAAACTTG 



(13) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 
CGTGTCTCTC CTTACTTCA 

(14) INFORMATION FOR SEQ ID NO: 13: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 
TCGGCGCAGT ACTTTGATAG TTAGAAAGTA GGTGAT 
(15) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 38 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 
TTCTAACTAT CAAAGTACTG CGCCGACAAG CTTTGATG 
(16) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 43 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 
TTCAGCAGTC AACCCACTAG T CTAT ACT CT GT T CAAC AAA ATT 
(17) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 
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ATTTCTAGAC ATATGTAGCT TGTACCGT 2 8 

(18) INFORMATION FOR SEQ ID NO:17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 
AT CAC CTACT TTCTAACTA 19 

(19) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 
CCATAATCGT CAGGGGAATG AAAAAT G AC A CAA 33 

(20) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 
ATTTTTCATT CCCCTGACGA T TAT G GT GAT TAC 3 3 

(21) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 
TGATGAAGAA AGGGCACCAC AT GAT C AGAA ACA 
(2) INFORMATION FOR SEQ ID NO:21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:21: 
GAT CAT GT G G TGCCCTTTCT T CAT C ACAAA CAT 
(23) INFORMATION FOR SEQ ID NO:22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 4 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:22: 
GAGACATATT ATCTGCCACG GAGG 
(24) INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 
TTGGCATAGA AACCGGACCC AAGG 
(25) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1416 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
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(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 

ATGGATATTC TTTGTGAAGA AAATACTTCT T T GAG CT CAA 
CTACGAACTC CCTAATGCAA 

TTAAATGATG AC AAC AG G C T CTACAGTAAT GACTTTAACT 
CCGGAGAAGC TAACACTTCT 

GATGCATTTA ACTGGACAGT CGACTCTGAA AATCGAACCA 
ACCTTTCCTG TGAAGGGTGC 

CTCTCACCGT CGTGTCTCTC CT TACT T CAT CTCCAGGAAA 
AAAACTGGTC TGCTTTACTG 

ACAGCCGTAG T GATTATT CT AACTATTGCT GGAAACATAC 
T C GT CAT CAT GGCAGTGTCC 

CTAGAGAAAA AGCTGCAGAA TGCCACCAAC TATTTCCTGA 
TGTCACTTGC CATAGCTGAT 

ATGCTGCTGG GTTTCCTTGT CATGCCCGTG TCCATGTTAA 
CCATCCTGTA TGGGTACCGG 

TGGCCTCTGC C GAGCAAG CT TTGTGCAGTC TGGATTTACC 
TGGACGTGCT CTTCTCCACG 

GCCTCCATCA TGCACCTCTG CGCCATCTCG CTGGACCGCT 
ACGTCGCCAT CCAGAATCCC 

ATCCACCACA GCCGCTTCAA CT C CAGAACT AAGGCATTTC 
T G AAAAT CAT TGCTGTTTGG 

AC CAT AT C AG TAGGTATATC CAT G C C AAT A CCAGTCTTTG 
GGCTACAGGA CGATTCGAAG 

GTCTTTAAGG AGGGGAGTTG CTTACTCGCC GATGATAACT 
TTGTCCTGAT CGGCTCTTTT 

GTGTCATTTT TCATTCCCTT AAC CAT CAT G GT GAT CAC CT 
ACTTTCTAAC TAT CAAGT C A 

CTCCAGAAAG AAGCTACTTT GTGTGTAAGT GATCTTGGCA 
CACGGGCCAA ATTAGCTTCT 

TTCAGCTTCC TCCCTCAGAG TTCTTTGTCT TCAGAAAAGC 
TCTTCCAGCG GTCGATCCAT 

AGGGAGCCAG GGTCCTACAC AGGCAGGAGG ACTATGCAGT 
C CAT C AG CAA T GAGCAAAAG 

GCAT GCAAGG TGCTGGGCAT CGTCTTCTTC CTGTTTGTGG 
TGATGTGGTG CCCTTTCTTC 

AT CAC AAAC A TCATGGCCGT CAT CT G CAAA GAGTCCTGCA 
AT GAGGAT GT CATTGGGGCC 
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CTGCTCAATG TGTTTGTTTG GATCGGTTAT CTCTCTTCAG 

CAGTCAACCC ACTAGT CTAC 114 0 

ACACTGTTCA ACAAGACCTA TAGGTCAGCC TTTTCACGGT 

ATATTCAGTG TCAGTACAAG 12 00 

GAAAACAAAA AAC CAT T G C A GTTAATTTTA GTGAACACAA 

TACCGGCTTT GGCCTACAAG 12 60 

TCTAGCCAAC TTCAAATGGG ACAAAAAAAG AATTCAAAGC 

AAGATGCCAA GACAACAGAT 1320 
AATGACTGCT CAATGGTTGC T CTAGGAAAG CAGTATTCTG 

AAGAGGCTTC TAAAGACAAT 138 0 

AGCGACGGAG TGAATGAAAA GGTGAGCTGT GTGTGA 1416 
(26) INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 470 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 

Met Asp lie Leu Cys Glu Glu Asn Thr Ser Leu Ser Ser Thr Thr Asn 
15 10 15 

Ser Leu Met Gin Leu Asn Asp Asp Asn Arg Leu Tyr Ser Asn Asp Phe 
20 25 30 

Asn Ser Gly Glu Ala Asn Thr Ser Asp Ala Phe Asn Trp Thr Val Asp 
35 40 45 

Ser Glu Asn Arg Thr Asn Leu Ser Cys Glu Gly Cys Leu Ser Pro Ser 
50 55 60 

Cys Ser Leu Leu His Leu Gin Glu Lys Asn Trp Ser Ala Leu Leu Thr 
65 70 75 80 

Ala Val Val lie He Leu Thr He Ala Gly Asn lie Leu Val He Met 
85 90 95 

Ala Val Ser Leu Glu Lys Lys Leu Gin Asn Ala Thr Asn Tyr Phe Leu 
100 " 105 HO 

Met Ser Leu Ala He Ala Asp Met Leu Leu Gly Phe Leu Val Met Pro 
115 120 125 

Val Ser Met Leu Thr He Leu Tyr Gly Tyr Arg Trp Pro Leu Pro Ser 
130 135 140 

Lys Leu Cys Ala Val Trp He Tyr Leu Asp Val Leu Phe Ser Thr Ala 
145 150 155 160 
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Ser lie Met His Leu Cys Ala lie Ser Leu Asp Arg Tyr Val Ala lie 
165 170 175 

Gin Asn Pro lie His His Ser Arg Phe Asn Ser Arg Thr Lys Ala Phe 
180 185 190 

Leu Lys He He Ala Val Trp Thr He Ser Val Gly He Ser Met Pro 
195 200 . 205 

He Pro Val Phe Gly Leu Gin Asp Asp Ser Lys Val Phe Lys Glu Gly 
210 215 220 

Ser Cys Leu Leu Ala Asp Asp Asn Phe Val Leu He Gly Ser Phe Val 
225 230 235 240 

Ser Phe Phe He Pro Leu Thr He Met Val He Thr Tyr Phe Leu Thr 
245 250 255 

He Lys Ser Leu Gin Lys Glu Ala Thr Leu Cys Val Ser Asp Leu Gly 
2 60 " 2 65 27 0 

Thr Arg Ala Lys Leu Ala Ser Phe Ser Phe Leu Pro Gin Ser Ser Leu 
275 280 285 

Ser Ser Glu Lys Leu Phe Gin Arg Ser He His Arg Glu Pro Gly Ser 
290 "* 295 300 

Tyr Thr Gly Arg Arg Thr Met Gin Ser He Ser Asn Glu Gin Lys Ala 
305 " 310 315 320 

Cys Lys Val Leu Gly He Val Phe Phe Leu Phe Val Val Met Trp Cys 
325 330 335 

Pro Phe Phe He Thr Asn He Met Ala Val He Cys Lys Glu Ser Cys 
340 345 350 

Asn Glu Asp Val He Gly Ala Leu Leu Asn Val Phe Val Trp He Gly 
355 360 365 

Tyr Leu Ser Ser Ala Val Asn Pro Leu Val Tyr Thr Leu Phe Asn Lys 
370 375 380 

Thr Tyr Arg Ser Ala Phe Ser Arg Tyr He Gin Cys Gin Tyr Lys Glu 
385 " 390 395 400 

Asn Lys Lys Pro Leu Gin Leu He Leu Val Asn Thr He Pro Ala Leu 
4 05 410 415 

Ala Tyr Lys Ser Ser Gin Leu Gin Met Gly Gin Lys Lys Asn Ser Lys 
420 425 430 

Gin Asp Ala Lys Thr Thr Asp Asn Asp Cys Ser Met Val Ala Leu Gly 
435 440 445 

Lys Gin Tyr Ser Glu Glu Ala Ser Lys Asp Asn Ser Asp Gly Val Asn 
450 455 460 

Glu Lys Val Ser Cys Val 
465 470 

(27) INFORMATION FOR SEQ ID NO: 26: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1377 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 



ATGGTGAACC 


TGAGGAATGC 


GGTGCATTCA 


TTCCTTGTGC 


ACCTAATTGG 


CCTATTGGTT 


60 


TGGCAATGTG 


ATATTTCT GT 


GAGCCCAGTA 


GCAGCTATAG 


T AACT GAC AT 


TTTCAATACC 


120 


TCCGATGGTG 


GACGCTTCAA 


AT T C C C AG AC 


GGGGTACAAA 


ACTGGCCAGC 


ACTTTCAATC 


180 


GT CAT CATAA 


T AAT CAT GAC 


AATAGGTGGC 


AACATCCTTG 


T GAT CAT G GC 


AGT AAG CAT G 


240 


GAAAAGAAAC 


TGCACAATGC 


CACCAATTAC 


TTCTTAATGT 


CCCTAGCCAT 


T G CT GAT AT G 


300 


CTAGT GGGAC 


TACT TGT CAT 


GCCCCTGTCT 


CTCCTGGCAA 


TCCTTTATGA 


TTATGTCTGG 


360 


CCACTACCTA 


GAT AT TT GT G 


CCCCGTCTGG 


ATTTCTTTAG 


ATGTTTTATT 


TTCAACAGCG 


420 


T C CAT CAT GC 


ACCTCTGCGC 


TAT AT C G CT G 


GAT C G GT AT G 


TAG C AAT AC G 


TAATCCTATT 


480 


GAGCATAGCC 


GTTTCAATTC 


GCGGACTAAG 


GC CAT CAT GA 


AGAT T GCT AT 


TGTTTGGGCA 


540 


AT T T CTAT AG 


GTGTATCAGT 


TCCTATCCCT 


GTGATTGGAC 


T GAGGGAC GA 


AGAAAAGGTG 


600 


TTCGTGAACA 


AC AC GAC GT G 


CGTGCTCAAC 


GAC C CAAATT 


TCGTTCTTAT 


TGGGTCCTTC 


660 


GTAGCTTTCT 


T CAT AC C GCT 


GAC GAT TAT G 


GTGATTACGT 


ATTGCCTGAC 


CATCTACGTT 


720 


CTGCGCCGAC 


AAGCTTTGAT 


GTTACTGCAC 


GGCCACACCG 


AGGAACCGCC 


TGGACTAAGT 


780 


CTGGATTTCC 


TGAAGTGCTG 


CAAGAGGAAT 


ACGGCCGAGG 


AAGAGAACTC 


TGCAAACCCT 


840 


AACCAAGACC 


AGAAC GC AC G 


CCGAAGAAAG 


AAGAAG GAGA 


GACGTCCTAG 


GGGCACCATG 


900 


CAGGCTATCA 


AC AAT GAAAG 


AAAAGCTTCG 


AAAGTCCTTG 


GGATTGTTTT 


CTTTGTGTTT 


960 


CTGATCATGT 


GGTGCCCATT 


TT T CAT T AC C 


AATATTCTGT 


CTGTTCTTTG 


TGAGAAGTCC 


1020 


TGTAACCAAA AG CT CAT GGA AAAGCTTCTG AATGTGTTTG 


TTTGGATTGG 


CTATGTTTGT 


1080 


TCAGGAATCA 


ATCCTCTGGT 


GTATACTCTG 


TT CAACAAAA 


TTTACCGAAG 


GGCATTCTCC 


1140 


AACTATTTGC 


GTTGCAATTA 


T AAG GT AGAG 


AAAAAGCCTC 


CTGTCAGGCA 


GATTCCAAGA 


1200 


GTTGCCGCCA 


CTGCTTTGTC 


TGGGAGGGAG 


CTTAATGTTA 


ACATTTATCG 


GCATAC CAAT 


1260 


GAACCGGTGA 


TCGAGAAAGC 


CAGTGACAAT 


GAGCCCGGTA 


T AGAG AT G C A 


AGTTGAGAAT 


1320 


TTAGAGTTAC 


CAGTAAATCC 


CTCCAGTGTG 


GT TAG C GAAA 


G GATT AG C AG 


TGTGTGA 


1377 



(2 8) INFORMATION FOR SEQ ID NO: 27: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 45 8 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27: 

Met Val Asn Leu Arg Asn Ala Val His Ser Phe Leu Val His Leu lie 
15 10 15 

Gly Leu Leu Val Trp Gin Cys Asp lie Ser Val Ser Pro Val Ala Ala 
20 25 30 

lie Val Thr Asp lie Phe Asn Thr Ser Asp Gly Gly Arg Phe Lys Phe 
35 ' 40 45 

Pro Asp Gly Val Gin Asn Trp Pro Ala Leu Ser lie Val lie lie lie 
50 " 55 60 

lie Met Thr lie Gly Gly Asn lie Leu Val lie Met Ala Val Ser Met 
65 70 75 80 

Glu Lys Lys Leu His Asn Ala Thr Asn Tyr Phe Leu Met Ser Leu Ala 
85 90 95 

He Ala Asp Met Leu Val Gly Leu Leu Val Met Pro Leu Ser Leu Leu 
100 105 HO 

Ala He Leu Tyr Asp Tyr Val Trp Pro Leu Pro Arg Tyr Leu Cys Pro 
115 '* 120 125 

Val Trp He Ser Leu Asp Val Leu Phe Ser Thr Ala Ser He Met His 
130 135 140 

Leu Cys Ala He Ser Leu Asp Arg Tyr Val Ala He Arg Asn Pro He 
145 150 155 160 

Glu His Ser Arg Phe Asn Ser Arg Thr Lys Ala He Met Lys He Ala 
165 170 175 

He Val Trp Ala He Ser He Gly Val Ser Val Pro He Pro Val He 
180 185 190 

Gly Leu Arg Asp Glu Glu Lys Val Phe Val Asn Asn Thr Thr Cys Val 
195 ' 200 205 

Leu Asn Asp Pro Asn Phe Val Leu He Gly Ser Phe Val Ala Phe Phe 
210 ~ 215 220 

He Pro Leu Thr He Met Val He Thr Tyr Cys Leu Thr He Tyr Val 
225 230 235 240 

Leu Arg Arg Gin Ala Leu Met Leu Leu His Gly His Thr Glu Glu Pro 
245 250 255 

Pro Gly Leu Ser Leu Asp Phe Leu Lys Cys Cys Lys Arg Asn Thr Ala 
260 265 270 
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Glu Glu Glu Asn Ser Ala Asn Pro Asn Gin Asp Gin Asn Ala Arg Arg 
275 280 285 

Arg Lys Lys Lys Glu Arg Arg Pro Arg Gly Thr Met Gin Ala lie Asn 
290 " 295 300 

Asn Glu Arg Lys Ala Ser Lys Val Leu Gly lie Val Phe Phe Val Phe 
305 ' 310 315 320 

Leu lie Met Trp Cys Pro Phe Phe lie Thr Asn lie Leu Ser Val Leu 
325 330 335 

Cys Glu Lys Ser Cys Asn Gin Lys Leu Met Glu Lys Leu Leu Asn Val 
340 345 350 

Phe Val Trp He Gly Tyr Val Cys Ser Gly He Asn Pro Leu Val Tyr 
355 360 365 

Thr Leu Phe Asn Lys He Tyr Arg Arg Ala Phe Ser Asn Tyr Leu Arg 
370 375 380 

Cys Asn Tyr Lys Val Glu Lys Lys Pro Pro Val Arg Gin He Pro Arg 
385 ~ J 390 395 400 

Val Ala Ala Thr Ala Leu Ser Gly Arg Glu Leu Asn Val Asn He Tyr 
405 410 415 

Arg His Thr Asn Glu Pro Val He Glu Lys Ala Ser Asp Asn Glu Pro 
420 425 430 

Gly He Glu Met Gin Val Glu Asn Leu Glu Leu Pro Val Asn Pro Ser 
435 440 445 

Ser Val Val Ser Glu Arg He Ser Ser Val 
450 455 

(29) INFORMATION FOR SEQ ID NO: 28: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1377 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28: 
ATGGTGAACC TGAGGAATGC GGTGCATTCA TTCCTTGTGC ACCTAATTGG CCTATTGGTT 
TGGCAATGTG ATATTTCTGT GAGCCCAGTA G C AG CT AT AG TAACTGACAT TTTCAATACC 
TCCGATGGTG GACGCTTCAA AT T C C C AG AC GGGGTACAAA ACTGGCCAGC ACTTTCAATC 
GT CAT CAT AA T AAT CAT GAC AATAGGTGGC AACATCCTTG TGATCATGGC AGTAAGCATG 
GAAAAGAAAC TGCACAATGC CAC C AAT T AC TTCTTAATGT CCCTAGCCAT T G CT GAT AT G 
CTAGTGGGAC T ACTT GT CAT GCCCCTGTCT CTCCTGGCAA TCCTTTATGA TTATGTCTGG 
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CCACTACCTA 


GAT AT T T GT G 


CCCCGTCTGG 


ATTTCTTTAG 


ATGTTTTATT 


T T C AAC AG C G 


420 


T C CAT CAT GC 


ACCTCTGCGC 


TAT AT C G CT G 


GAT C G GT AT G 


TAG CAAT AC G 


TAATCCTATT 


480 


GAGCATAGCC 


GTTTCAATTC 


GCGGACTAAG 


G C CAT CAT G A 


AG AT T G C TAT 


TGTTTGGGCA 


540 


ATTTCTATAG 


GTGTATCAGT 


TCCTATCCCT 


GTGATTGGAC 


TGAGGGACGA 


AGAAAAG GT G 


600 


TTCGTGAACA 


ACACGACGTG 


CGTGCTCAAC 


GACCCAAATT 


TCGTTCTTAT 


TGGGTCCTTC 


660 


GTAGCTTTCT 


TCATACCGCT 


GACGATTATG 


GT GAT T AC GT 


ATTGCCTGAC 


CATCTACGTT 


720 


CTGCGCCGAC 


AAGCTTTGAT 


GTTACTGCAC 


GGCCACACCG 


AGGAACCGCC 


TGGACTAAGT 


780 


CTGGATTTCC 


TGAAGTGCTG 


CAAGAGGAAT 


ACGGCCGAGG 


AAGAGAACTC 


TGCAAACCCT 


840 


AAC C AAG AC C 


AGAACGCACG 


CCGAAGAAAG 


AAG AAG GAGA 


G AC GT C C TAG 


GGGCAC CAT G 


900 


CAGGCTATCA 


ACAAT GAAAG 


AAAAGCTAAG 


AAAGTCCTTG 


GGATTGTTTT 


CTTTGTGTTT 


960 


CT GAT CAT GT 


GGTGCCCATT 


TTTCATTACC 


AATATT CT GT 


CTGTTCTTTG 


TGAGAAGTCC 


1020 


TGTAACCAAA 


AGCTCATGGA 


AAAGCTTCTG 


AATGT GTTTG 


TTTGGATTGG 


CTATGTTTGT 


1080 


T CAGGAAT CA 


ATCCTCTGGT 


GTATACTCTG 


TTCAACAAAA 


TTTACCGAAG 


GGCATTCTCC 


1140 


AAC TAT T T G C 


GTTGCAATTA 


TAAGGTAGAG 


AAAAAGC CT C 


CTGTCAGGCA 


GAT T C C AAGA 


i o n n 


GTTGCCGCCA 


CTGCTTTGTC 


TGGGAGGGAG 


CTTAATGTTA 


ACATTTATCG 


GCATAC CAAT 


1260 


GAACCGGTGA 


TCGAGAAAGC 


C AGT GACAAT 


GAGCCCGGTA 


TAGAGAT GCA 


AGTT GAGAAT 


1320 


TTAGAGTTAC 


CAGTAAATCC 


CTCCAGTGTG 


GTTAGCGAAA 


G GAT TA.GCAG 


TGTGTGA 


1377 


(30) INFORMATION FOR 


SEQ ID NO: 29: 









(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 458 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:29: 

Met Val Asn Leu Arg Asn Ala Val His Ser Phe Leu Val His Leu He 
15 10 15 

Gly Leu Leu Val Trp Gin Cys Asp He Ser Val Ser Pro Val Ala Ala 
20 * 25 30 

He Val Thr Asp He Phe Asn Thr Ser Asp Gly Gly Arg Phe Lys Phe 
35 40 45 

Pro Asp Gly Val Gin Asn Trp Pro Ala Leu Ser He Val He He He 
50 1 55 60 

He Met Thr He Gly Gly Asn He Leu Val He Met Ala Val Ser Met 
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65 70 75 80 

Glu Lys Lys Leu His Asn Ala Thr Asn Tyr Phe Leu Met Ser Leu Ala 
85 90 95 

lie Ala Asp Met Leu Val Gly Leu Leu Val Met Pro Leu Ser Leu Leu 
100 105 HO 

Ala lie Leu Tyr Asp Tyr Val Trp Pro Leu Pro Arg Tyr Leu Cys Pro 
115 ' 120 125 

Val Trp lie Ser Leu Asp Val Leu Phe Ser Thr Ala Ser lie Met His 
130 135 140 

Leu Cys Ala lie Ser Leu Asp Arg Tyr Val Ala He Arg Asn Pro He 
145 150 155 160 

Glu His Ser Arg Phe Asn Ser Arg Thr Lys Ala He Met Lys He Ala 
165 170 175 

He Val Trp Ala He Ser He Gly Val Ser Val Pro He Pro Val He 
180 185 190 

Gly Leu Arg Asp Glu Glu Lys Val Phe Val Asn Asn Thr Thr Cys Val 
195 200 205 

Leu Asn Asp Pro Asn Phe Val Leu He Gly Ser Phe Val Ala Phe Phe 
210 215 220 

He Pro Leu Thr He Met Val He Thr Tyr Cys Leu Thr He Tyr Val 
225 230 235 240 

Leu Arg Arg Gin Ala Leu Met Leu Leu His Gly His Thr Glu Glu Pro 
245 250 255 

Pro Gly Leu Ser Leu Asp Phe Leu Lys Cys Cys Lys Arg Asn Thr Ala 
260 265 270 

Glu Glu Glu Asn Ser Ala Asn Pro Asn Gin Asp Gin Asn Ala Arg Arg 
275 280 285 

Arg Lys Lys Lys Glu Arg Arg Pro Arg Gly Thr Met Gin Ala He Asn 
290 ' 295 300 

Asn Glu Arg Lys Ala Lys Lys Val Leu Gly He Val Phe Phe Val Phe 
305 310 315 320 

Leu He Met Trp Cys Pro Phe Phe He Thr Asn He Leu Ser Val Leu 
325 330 335 

Cvs Glu Lys Ser Cys Asn Gin Lys Leu Met Glu Lys Leu Leu Asn Val 
340 345 350 

Phe Val Trp He Gly Tyr Val Cys Ser Gly He Asn Pro Leu Val Tyr 
355 " 360 365 

Thr Leu Phe Asn Lys He Tyr Arg Arg Ala Phe Ser Asn Tyr Leu Arg 
370 375 380 

Cys Asn Tyr Lys Val Glu Lys Lys Pro Pro Val Arg Gin He Pro Arg 
385 390 395 400 
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Val Ala Ala Thr Ala Leu Ser Gly Arg Glu Leu Asn Val Asn lie Tyr 
405 410 415 

Arg His Thr Asn Glu Pro Val lie Glu Lys Ala Ser Asp Asn Glu Pro 
420 425 430 

Gly lie Glu Met Gin Val Glu Asn Leu Glu Leu Pro Val Asn Pro Ser 
435 440 445 

Ser Val Val Ser Glu Arg lie Ser Ser Val 
450 455 

(31) INFORMATION FOR SEQ ID NO: 30: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1437 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30: 

AT GGAT AT T C TTTGTGAAGA AAATACTTCT TTGAGCTCAA CTACGAACTC CCTAATGCAA 60 

TTAAATGATG ACAACAG GCT CTACAGTAAT GACTTTAACT CCGGAGAAGC TAACACTTCT 120 

' GAT GCATTT A ACT G GACAGT CGACTCTGAA AATCGAACCA ACCTTTCCTG TGAAGGGTGC 18 0 

CTCTCACCGT CGTGTCTCTC CTTACTTCAT CTCCAGGAAA AAAACTGGTC TGCTTTACTG 240 

AC AG C C GT AG TGATTATTCT AACTATTGCT G GAAAC AT AC T C GT CAT CAT GGCAGTGTCC 300 

CTAGAGAAAA AGCTGCAGAA TGCCACCAAC TATTTCCTGA TGTCACTTGC CATAGCTGAT 3 60 

ATGCTGCTGG GTTTCCTTGT CATGCCCGTG T C CAT GTTAA CCATCCTGTA TGGGTACCGG 42 0 

TGGCCTCTGC CGAGCAAGCT TTGTGCAGTC TGGATTTACC TGGACGTGCT CTTCTCCACG 4 80 

GCCTCCATCA TGCACCTCTG CGCCATCTCG CTGGACCGCT ACGTCGCCAT CCAGAATCCC 54 0 

ATCCACCACA GCCGCTTCAA CTCCAGAACT AAGGCATTTC T GAAAAT CAT TGCTGTTTGG 60 0 

AC CAT AT C AG TAGGTATATC CAT G C C AAT A CCAGTCTTTG GGCTACAGGA CGATTCGAAG 660 

GTCTTTAAGG AGGGGAGTTG CTTACTCGCC GATGATAACT TTGTCCTGAT CGGCTCTTTT 72 0 

GTGTCATTTT TCATTCCCTT AAC CAT CAT G GTGATCACCT ACTTTCTAAC TAT CAAGGTT 780 

CTGCGCCGAC AAGCTTTGAT GTTACTGCAC GGCCACACCG AGGAACCGCC TGGACTAAGT 84 0 

CTGGATTTCC TGAAGTGCTG C AAGAG GAAT ACGGCCGAGG AAGAGAACTC TGCAAACCCT 900 

AACCAAGACC AGAACGCACG C C GAAGAAAG AAGAAGGAGA GACGTCCTAG GGGCACCATG 960 

CAGGCTATCA AC AAT GAAAG AAAAGCTTCG AAGGTACTGG GCATCGTCTT CTTCCTGTTT 1020 

GTGGTGATGT GGTGCCCTTT CTT CAT CACA AAC AT CAT G G CCGTCATCTG CAAAGAGTCC 1080 
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TGCAATGAGG AT GTCATTGG GGCCCTGCTC AATGTGTTTG TTTGGATCGG TTATCTCTCT 1140 

TCAGCAGTCA ACCCACTAGT CTATACTCTG TTCAACAAAA TTTACCGAAG GGCATTCTCC 12 0 0 

AACTATTTGC GTTGCAATTA TAAGGTAGAG AAAAAGCCTC CTGTCAGGCA GATTCCAAGA 12 60 

GTTGCCGCCA CTGCTTTGTC TGGGAGGGAG CTTAATGTTA AC AT T TAT C G G CAT AC CAAT 132 0 

GAACCGGTGA TCGAGAAAGC CAGTGACAAT GAGCCCGGTA TAGAGATGCA AGTT GAGAAT 13 8 0 

TTAGAGTTAC CAGTAAATCC CTCCAGTGTG GTTAGCGAAA GGATTAGCAG TGTGTGA 1437 
(32) INFORMATION FOR SEQ ID NO: 31: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 47 8 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 31: 

Met Asp lie Leu Cys Glu Glu Asn Thr Ser Leu Ser Ser Thr Thr Asn 
1 5 10 15 

Ser Leu Met Gin Leu Asn Asp Asp Asn Arg Leu Tyr Ser Asn Asp Phe 
20 25 30 

Asn Ser Gly Glu Ala Asn Thr Ser Asp Ala Phe Asn Trp Thr Val Asp 
35 40 45 

Ser Glu Asn Arg Thr Asn Leu Ser Cys Glu Gly Cys Leu Ser Pro Ser 
50 55 60 

Cys Leu Ser Leu Leu His Leu Gin Glu Lys Asn Trp Ser Ala Leu Leu 
65 70 75 80 

Thr Ala Val Val lie lie Leu Thr lie Ala Gly Asn lie Leu Val lie 
85 90 95 

Met Ala Val Ser Leu Glu Lys Lys Leu Gin Asn Ala Thr Asn Tyr Phe 
100 105 110 

Leu Met Ser Leu Ala lie Ala Asp Met Leu Leu Gly Phe Leu Val Met 
115 120 125 

Pro Val Ser Met Leu Thr He Leu Tyr Gly Tyr Arg Trp Pro Leu Pro 
130 135 140 

Ser Lys Leu Cys Ala Val Trp He Tyr Leu Asp Val Leu Phe Ser Thr 
145 " 150 155 160 

Ala Ser He Met His Leu Cys Ala He Ser Leu Asp Arg Tyr Val Ala 
165 170 175 

He Gin Asn Pro He His His Ser Arg Phe Asn Ser Arg Thr Lys Ala 
180 185 190 
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Phe Leu Lys He He Ala Val Trp Thr He Ser Val Gly lie Ser Met 
195 200 205 

Pro He Pro Val Phe Gly Leu Gin Asp Asp Ser Lys Val Phe Lys Glu 
210 215 220 

Gly Ser Cys Leu Leu Ala Asp Asp Asn Phe Val Leu He Gly Ser Phe 
225 230 235 240 

Val Ser Phe Phe He Pro Leu Thr He Met Val He Thr Tyr Phe Leu 
245 250 255 

Thr He Lys Val Leu Arg Arg Gin Ala Leu Met Leu Leu His Gly His 
2 60 2 65 270 

Thr Glu Glu Pro Pro Gly Leu Ser Leu Asp Phe Leu Lys Cys Cys Lys 
275 280 285 

Arg Asn Thr Ala Glu Glu Glu Asn Ser Ala Asn Pro Asn Gin Asp Gin 
290 295 300 

Asn Ala Arg Arg Arg Lys Lys Lys Glu Arg Arg Pro Arg Gly Thr Met 
305 310 315 320 

Gin Ala He Asn Asn Glu Arg Lys Ala Ser Lys Val Leu Gly He Val 
325 330 335 

Phe Phe Leu Phe Val Val Met Trp Cys Pro Phe Phe He Thr Asn He 
340 345 350 

Met Ala Val He Cys Lys Glu Ser Cys Asn Glu Asp Val He Gly Ala 
355 1 ~ 360 365 

Leu Leu Asn Val Phe Val Trp He Gly Tyr Leu Ser Ser Ala Val Asn 
370 375 380 

Pro Leu Val Tyr Thr Leu Phe Asn Lys He Tyr Arg Arg Ala Phe Ser 
385 390 395 400 

Asn Tyr Leu Arg Cys Asn Tyr Lys Val Glu Lys Lys Pro Pro Val Arg 
405 410 415 

Gin He Pro Arg Val Ala Ala Thr Ala Leu Ser Gly Arg Glu Leu Asn 
420 425 430 

Val Asn He Tyr Arg His Thr Asn Glu Pro Val He Glu Lys Ala Ser 
435 440 445 

Asp Asn Glu Pro Gly He Glu Met Gin Val Glu Asn Leu Glu Leu Pro 
450 " 455 460 

Val Asn Pro Ser Ser Val Val Ser Glu Arg He Ser Ser Val 
465 470 475 

(33) INFORMATION FOR SEQ ID NO: 32: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1437 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 32: 



AT GGAT ATT C 


T T T GT GAAGA AAATACTTCT 


T T GAG CT CAA 


CTACGAACTC 


C CT AAT G CAA 


60 


TTAAATGATG 


ACAACAGGCT 


CTACAGTAAT 


GACTTTAACT 


CCGGAGAAGC 


TAACACTTCT 


120 


GAT GCATTT A 


ACTGGACAGT 


CGACTCTGAA AATCGAACCA ACCTTTCCTG 


TGAAGGGTGC 


180 


CTCTCACCGT 


CGTGTCTCTC 


CTTACTT CAT 


CT C CAGGAAA 


AAAACTGGTC 


TGCTTTACTG 


240 


ACAGCCGTAG 


T GAT TAT T CT 


AACTATTGCT 


G GAAAC AT AC 


T C GT CAT CAT 


GGCAGTGTCC 


300 


CTAGAGAAAA 


AGCTGCAGAA 


TGCCACCAAC 


TATTTCCTGA 


TGTCACTTGC 


CATAGCTGAT 


360 


ATGCTGCTGG 


GTTTCCTTGT 


CATGCCCGTG 


T C CAT GTTAA 


C CAT C CT GT A 


TGGGTACCGG 


420 


TGGCCTCTGC 


CGAGCAAGCT 


TTGTGCAGTC 


TGGATTTACC 


TGGACGTGCT 


CTTCTCCACG 


480 


GCCTCCATCA 


TGCACCTCTG 


CGCCATCTCG 


CTGGACCGCT 


ACGTCGCCAT 


CCAGAATCCC 


540 


ATCCACCACA 


GCCGCTTCAA 


CT C CAGAACT 


AAGGCATTTC 


T GAAAAT CAT 


TGCTGTTTGG 


600 


AC CAT AT C AG 


TAG GT AT AT C 


CAT G C C AAT A 


C CAGT CTT TG 


GGCTACAGGA 


CGATTCGAAG 


660 


GTCTTTAAGG 


AGGGGAGTTG 


CTTACTCGCC 


GATGATAACT 


TTGTCCTGAT 


CGGCTCTTTT 


720 


GTGTCATTTT 


TCATTCCCCT 


GAC GATT AT G 


GT GATTACGT 


ATTGCCTGAC 


CAT CT AC GTT 


780 


CTGCGCCGAC 


AAGCTTTGAT 


GTTACTGCAC 


GGCCACACCG 


AGGAACCGCC 


TGGACTAAGT 


840 


CTGGATTTCC 


TGAAGTGCTG 


CAAGAGGAAT 


ACGGCCGAGG 


AAGAGAACTC 


TGCAAACCCT 


900 


AACCAAGACC 


AGAACGCACG 


CCGAAGAAAG 


AAGAAG GAGA 


GAC GT C CT AG 


G G GCAC CAT G 


960 




ACAAT GAAAG 


AAAAGCTAAG 


AAAGTCCTTG 


GGATTGTTTT 


CTTTGTGTTT 


1020 


CTGATCATGT 


GGTGCCCTTT 


CTT CAT CACA AACAT CAT G G 


CCGTCATCTG 


CAAAGAGTCC 


1080 


TGCAATGAGG 


ATGTCATTGG 


GGCCCTGCTC 


AATGTGTTTG 


TTTGGATCGG 


TTATCTCTCT 


1140 


TCAGCAGTCA 


ACCCACTAGT 


CTAT ACT CT G 


TTCAACAAAA 


TTTACCGAAG 


GGCATTCTCC 


1200 


AACTATTTGC 


GTTGCAATTA 


TAAGGTAGAG 


AAAAAGCCTC 


CT GT CAGGCA 


GATTCCAAGA 


1260 


GTTGCCGCCA 


CTGCTTTGTC 


TGGGAGGGAG 


C T T AAT GT T A 


AC AT T TAT C G 


G CAT AC CAAT 


1320 


GAACCGGTGA 


TCGAGAAAGC 


CAGT GACAAT 


GAGCCCGGTA 


TAGAGATGCA 


AGTT GAGAAT 


1380 


TTAGAGTTAC 


CAGTAAATCC 


CTCCAGTGTG 


GTTAGCGAAA 


GGAT TAG CAG 


TGTGTGA 


1437 



(34) INFORMATION FOR SEQ ID NO: 33: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 478 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: not relevant 
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<ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 33: 

Met Asp lie Leu Cys Glu Glu Asn Thr Ser Leu Ser Ser Thr Thr Asn 
15 10 15 

Ser Leu Met Gin Leu Asn Asp Asp Asn Arg Leu Tyr Ser Asn Asp Phe 
20 25 30 

Asn Ser Gly Glu Ala Asn Thr Ser Asp Ala Phe Asn Trp Thr Val Asp 
35 40 45 

Ser Glu Asn Arg Thr Asn Leu Ser Cys Glu Gly Cys Leu Ser Pro Ser 
50 55 60 

Cys Leu Ser Leu Leu His Leu Gin Glu Lys Asn Trp Ser Ala Leu Leu 
65 70 75 80 

Thr Ala Val Val lie lie Leu Thr lie Ala Gly Asn lie Leu Val lie 
85 90 95 

Met Ala Val Ser Leu Glu Lys Lys Leu Gin Asn Ala Thr Asn Tyr Phe 
100 105 HO 

Leu Met Ser Leu Ala He Ala Asp Met Leu Leu Gly Phe Leu Val Met 
115 120 125 

Pro Val Ser Met Leu Thr He Leu Tyr Gly Tyr Arg Trp Pro Leu Pro 
130 135 140 

Ser Lys Leu Cys Ala Val Trp He Tyr Leu Asp Val Leu Phe Ser Thr 
145 150 155 160 

Ala Ser He Met His Leu Cys Ala He Ser Leu Asp Arg Tyr Val Ala 
165 170 175 

He Gin Asn Pro He His His Ser Arg Phe Asn Ser Arg Thr Lys Ala 
180 185 190 

Phe Leu Lys He He Ala Val Trp Thr He Ser Val Gly He Ser Met 
195 200 205 

Pro He Pro Val Phe Gly Leu Gin Asp Asp Ser Lys Val Phe Lys Glu 
210 215 220 

Gly Ser Cys Leu Leu Ala Asp Asp Asn Phe Val Leu He Gly Ser Phe 
225 230 235 240 

Val Ser Phe Phe He Pro Leu Thr He Met Val He Thr Tyr Cys Leu 
245 250 255 

Thr He Tyr Val Leu Arg Arg Gin Ala Leu Met Leu Leu His Gly His 
260 265 270 

Thr Glu Glu Pro Pro Gly Leu Ser Leu Asp Phe Leu Lys Cys Cys Lys 
275 280 285 

Arg Asn Thr Ala Glu Glu Glu Asn Ser Ala Asn Pro Asn Gin Asp Gin 
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290 295 300 

Asn Ala Arg Arg Arg Lys Lys Lys Glu Arg Arg Pro Arg Gly Thr Met 
305 310 315 320 

Gin Ala lie Asn Asn Glu Arg Lys Ala Lys Lys Val Leu Gly lie Val 
325 330 335 

Phe Phe Val Phe Leu He Met Trp Cys Pro Phe Phe He Thr Asn He 
340 345 350 

Met Ala Val He Cys Lys Glu Ser Cys Asn Glu Asp Val He Gly Ala 
355 ~ 360 365 

Leu Leu Asn Val Phe Val Trp He Gly Tyr Leu Ser Ser Ala Val Asn 
370 375 380 

Pro Leu Val Tyr Thr Leu Phe Asn Lys He Tyr Arg Arg Ala Phe Ser 
385 390 395 400 

Asn Tyr Leu Arg Cys Asn Tyr Lys Val Glu Lys Lys Pro Pro Val Arg 
405 410 415 

Gin He Pro Arg Val Ala Ala Thr Ala Leu Ser Gly Arg Glu Leu Asn 
420 425 430 

Val Asn He Tyr Arg His Thr Asn Glu Pro Val He Glu Lys Ala Ser 
435 440 445 

Asp Asn Glu Pro Gly He Glu Met Gin Val Glu Asn. Leu Glu Leu Pro 
450 * 455 460 

Val Asn Pro Ser Ser Val Val Ser Glu Arg He Ser Ser Val 
465 470 475 
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